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THEME 


The  theme  of  this  Specialists’  Meeting  was  two-fold.  It  was  first  intended  to  make  a 
retrospective  judgement  of  significant  developments  which  have  occurred  in  the  aerospace 
and  defence  information  field,  in  particular,  and  the  information  science  field  in  general, 
over  the  past  two  decades.  Specific  problems  which  to  date  remain  unsolved  (such  as 
automatic  indexing,  fact  retrieval,  and  input  standardization)  were  reviewed.  Then,  the 
Meeting  looked  ahead  to  the  ‘80’s,  speculating  as  to  the  technical  and  sociological  changes 
which  might  occur,  and  trying  to  assess  what  will  be  the  impact  of  these  changes  upon  the 
information  services  available  towards  the  end  of  the  decade.  The  programme  concluded 
with  a  user-supplier  dialogue  in  the  context  of  a  general  Forum  Discussion. 


THEME 


Cette  reunion  de  Sp6cialistes  s’est  organisie  autour  d’un  double  theme.  11  fut  tout 
d’abord  precede  a  un  jugement  rftrospectif  des  ddveloppements  importants  intervenus 
au  cours  des  deux  demises  decades,  en  particular  dans  le  domaine  de  l’information 
touchant  la  defense  et  les  sciences  alrospatiales,  et,  en  general,  dans  le  domaine  de  la 
science  de  l’information.  Certains  probtemes  sp4cifiques  n’ayant  pas  encore  trouvi  leur 
solution  (comme  l’indexation  automatique,  l’extraction  des  faits  et  la  standardisation  des 
entries)  furent  passes  en  revue.  Puis  la  rtunion  se  toumait  vers  1’horizon  80,  se  livrait 
a  des  speculations  sur  les  modifications  techniques  et  sociologiques  susceptibles  de  se 
produire,  et  s’efforcait  d’ivaluer  I’impact  de  ces  modifications  sur  les  services  informatiques 
qui  existeront  vers  la  fin  de  la  decade.  Le  programme  s’achivait  par  un  dialogue  entre 
foumisseur  et  usager  dans  le  contexte  d’un  forum  general. 
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CHANGING  INFORMATION  SCENES  FROM  THE  TIP  SPECIALISTS’ 
MEETING  1968  IN  MUNICH  TO  THE  EIGHTIES 

Mr  R  K  H  BREE 
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5330  Kfinigswinter  41 
Fed  Rep  of  Germany 


INTRODUCTORY  REMARKS 

The  Programme  Committee  of  AGARD  TIP  has  had  the  courage  to  invite  me  to  present  the 
first  Paper  of  this  year's  meeting.  This  means  inviting  a  colleague  who  retired  from 
his  professional  duties  almost  ten  years  ago.  I  take  it  that  the  idea  to  do  so  has 
been  influenced  by  the  fact  that,  thirteen  years  ago  and  exactly  in  the  same  place, 

I  had  terminated  the  1968  TIP  meeting  by  summarizing  it. 


Looking  up  the  Proceedings  of  the  1968  meeting  I  found  that,  by  whatever  circumstances, 
this  summary  had  not  been  included  and  kept  there  for  eternity's  sake,  yet  I  remember 
very  well  my  statement  of  that  time.  I  expressed  some  concern  about  the  fact  that  the 
papers  presented  and  discussed  had  much  to  do  with  the  technical  aspects  and  organisation 
of  information  services  and  very  little  with  the  situation  of  the  user,  even  though  the 
meeting  was  supposed  to  amount  to  a  "User  and  Supplier  Dialogue  on  Storage  and  Retrieval 
of  Information".  As  usual,  the  suppliers'  problems  had  dominated  the  meeting  -  much  to 
my  regret  -  and  I  said  so.  The  end  of  this  year's  meeting  might  teach  us  how  far  the 
scene  has  changed  in  this  respect. 


First  of  all  I  would  like  to  thank  AGARD  TIP  for  the  invitation,  which,  of  course, 
proved  to  be  quite  irresistible  to  me,  not  only  for  the  sheer  flattery  it  meant,  but 
because  it  offered  once  more  the  opportunity  to  be  with  this  group.  My  own  former 
longstanding  membership  in  AGARD  TIP  me  amt  so  much  to  me  in  retrospect  that  you  will, 
I  hope,  permit  me  a  few  remarks  on  this  relationship. 


It  has  been  almost  exactly  25  years  since  I  accompanied  Erich  Pletsch  to  Brussels  to 
attend  my  first  meeting  of  AGARD.  Erich  Pletsch  must  be  remembered  as  an  outstanding 
personality  in  our  professional  field.  He  was  at  that  time  not  only  the  director  of 
the  world-famous  Gmelin  Institute  for  Inorganic  Chemistry  but  as  well  later  on  the  most 
resolute  promoter  of  the  ideas  of  Alvin  Weinberg  in  this  country.  The  TIP  meeting  in 
1956  was  directed  by  Gene  Jackson  of  the  U.  S.  We  newcomers  were  at  once  warmly  accepted  ' 
and  the  unusually  cooperative  atmosphere  of  the  Panel  struck  me  at  once.  It  didn't 
merely  express  a  certain  mood:  the  substantial  practical  assistance  given  to  the  newly 
emerging  German  centre  for  aeronautical  information  proved  this  at  once.  Especially 
grateful  we  had  to  be  to  our  British  colleagues,  Messrs  Holloway,  Haylor  and  Vessey. 

These  colleagues  and  others  of  the  Panel  Members  became  in  due  time  real  personal 
friends  to  me.  I  would  mention  just  two  names  for  all  of  them:  Jan  Schuller  from  the 
Netherlands  and  Jean  Klopp  from  France.  Unforgettable,  too,  will  always  be 
Col  Woodrow  Dunlop,  in  his  time  head  of  ASTIA,  the  Information  agency  of  the  U.S.  Armed 
Forces.  His  most  courageous  approach  in  tackling  the  problems  of  storage  and  retrieval 
at  that  time  has  been  for  the  rest  of  us  a  substantial  benefit,  as  we  learned  much  from 
ASTIA' s  experience.  I  believe,  however,  that  his  authorities  at  that  time  were  not 
half  as  grateful  to  him  for  his  courage  of  sticking  his  neck  so  far  out  as  we  quite 
honestly  were. 


During  my  better  than  fifteen  years  of  membership  of  AGARD  TIP  I  had  much  opportunity 
to  compare  its  performance  to  that  of  many  other  international  organisations.  In  my 
experience  TIP  was  always  one  of  the  best  organisations  because  of  its  smooth  and 
efficient  cooperation.  May  this  friendly  spirit  in  the  Panel  go  on  for  many  years 
still  to  come! 


Now  to  my  topic  of  today:  I  shall  try  and  look  at  the  changing  information  scene,  with 
emphasis  on  storage  and  retrieval,  since  1968.  At  that  time  Alvin  Weinberg's  appeal 
to  his  government  for  paying  much  greater  attention  to  the  potential  value  of  information 
for  the  modern  society  and  its  development  had  borne  fruit.  He  had  postulated  much 
higher  efforts  for  the  careful  use  and  evaluation  of  the  worldwide  recorded  scientific 
and  technical  experience.  He  not  only  succeeded  with  his  own  government.  He  found 
serious  attention,  too,  within  the  entire  industrial  world.  Generally,  one  became 
aware  that  this  recorded  experience  presented  a  treasure  and  an  inexhaustible  resource 
which  had  to  be  handled  as  carefully  as  any  other  vital  resource.  This  meant  that  it  had 
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to  ne  made  accessible  as  widely  as  possible.  The  governments  in  other  countries  had 
learnt  their  lesson,  too,  even  though  they  had  to  foot  the  bills  for  the  ensuing 
developments.  As  far  as  suppliers  were  concerned,  we  seemed  to  be  in  1968  right  at 
the  beginning  of  a  fast-spreading  boom  of  information  activities. 


The  situation  was  indeed  challenging.  Computers  offered  new  and  promising  opportunities 
for  storage  and  retrieval.  Only  a  few  information  systems  were  at  that  time  already 
fully  operational  and,  if  I  remember  correctly,  all  of  them  worked  in  batch-processing. 
Interconnections  between  operating  systems  were  non-existent.  In  general,  most  of  the 
operational  steps  for  storage  and  retrieval  were  still  in  a  testing  stage  and  were  much 
debated.  The  hardware  in  use  was  not  only  widely  varied  but  in  a  period  of  steep 
development  to  ever-higher  capacity  and  performances.  This  demanded  a  continuous 
adaptation  of  the  programming  for  the  suppliers. 


The  basis  for  any  successful  retrieval  was  the  quality  of  the  input.  How  to  achieve 
dependable  recall  and  sufficient  precision  was  at  that  time  much  discussed.  How  to 
handle  less  familiar  languages,  especially  Eastern  ones,  started  to  worry  the  suppliers 
as  a  serious  challenge.  However,  the  main  efforts  concentrated  on  mastering  all  the 
techniques  for  assuring  satisfactory  output. 


The  social,  legal,  and  economic  consequences  arising  with  and  from  the  operation  of 
the  new  Information  systems  had  not  yet  come  to  play  a  major  role  during  discussions  at 
that  time. 


Only  part  of  the  user  potential  had  already  access  to  the  emerging  new  systems.  The 
vast  majority  still  relied  on  their  traditional  means  for  getting  access  to  recorded 
data  on  literature  and  facts.  They  browsed  secondary  publications  and  their  indexes, 
checked  tables  of  contents,  and  looked  up  handbooks  and  printed  tables.  In  many  fields 
they  could  rely  for  their  guidance  on  progress  reports  of  often  remarkably  high  standard, 
which  offered  an  extremely  valuable  help  for  singling  out  works  of  quality  and  were  also 
a  very  useful  tool  for  maintaining  current  awareness  of  their  readers.  Where  are  they 
left  today?  All  this  had  a  high  human  immediateness  but  -  alas!  -  these  do-it-yourself 
methods  became  more  and  more  time-consuming  and  less  and  less  satisfactory  with  the 
steadily  growing  quantities  of  new  literature  published. 


Small  wonder  that  the  users  started  looking  forward  hopefully  to  the  promised  land  of 
a  new  information  technology.  Its  prophets  did  much  for  stimulating  high  hopes,  possibly 
even  too  high.  I  often  had  the  feeling  at  that  time  that  in  the  heat  of  the  discussions 
the  term  "information"  was  too  often  (and  in  gross  simplification)  used  for  the  output 
of  the  new  systems,  even  though  this  output  could  not  offer  more  than  bibliographic 
data  on  available  literature.  This  amounted  to  suggesting  to  the  user  that  the  new 
systems  would  be  capable  of  relieving  him  fully  of  any  further  use  of  his  brain.  The 
new  systems  seemed  to  be  able  to  spit  out  on  demand  and  in  no  time  all  the  relevant 
answers  to  the  users'  problems,  ready  for  instant  use.  How  misleading  this  concept  was 
the  users  would  soon  come  to  experience. 


For  me,  and  in  our  context,  the  term  "information"  should  cover  that  process  which  goes 
on  inside  the  brain  of  the  user  when  he  is  confronting  his  professional  problem  with  the 
experience  of  other  people,  as  recorded  in  whatsoever  publication.  At  their  very  best 
good  Information  systems  can  only  support  that  process  by  providing  data  on  relevant 
recorded  experience,  be  it  on  facts,  be  it  on  literature.  This  contribution  is  today 
certainly  indispensable  in  view  of  the  quantity  of  reporting,  but  even  so  it  can  never 
be  more  than  a  preliminary  step  for  the  subsequent  intellectual  process.  You  know  the 
old  saying:  "Beauty  is  in  the  eye  of  the  beholder"?  In  a  very  similar  way  one  could 
state:  "Information  is  in  the  brain  of  the  reader"! 


Those  using  at  that  time  the  emerging  new  services  soon  came  to  recognize  that  the  quality 
of  the  answers  received  from  the  systems  depended  mainly  upon  the  quality  of  their  own 
requests,  -  that  is,  the  honest  ones  admitted  this.  For  others,  any  less  satisfactory 
results  derived  solely  from  the  deficiencies  of  the  new  systems.  Not  all  of  the  users 
were  ready  for  the  Inevitable  changing  of  their  habits,  when  compiling  their  requirements. 


In  1968  the  ancient  tradition  of  the  libraries  of  giving  free  service  from  their  holdings 
prevailed  as  well  in  the  new  information  services.  At  its  worst  the  users  had  to  pay 
token  charges. 


I  remember  1968  to  be  a  time  full  of  exploratory  initiatives  and  high  expectations 
on  the  side  of  the  suppliers  and  full  of  suspense  and  hopes  on  the  users'  side. 
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Details  of  the  present  situation  on  the  suppliers'  side  are  much  better  known  by  this 
audience  than  by  myself.  The  excellent  contributions  to  be  expected  from  Mr  Ember, 
Monsieur  Yanez  and  Mrs  Lamvik  are  coming  closest  to  depicting  the  present  situation 
in  its  full  complexity  and  from  many  more  angles  than  it  is  possible  for  me  to  consider 
in  this  introductory  contribution.  I  think  we  can  agree  that  the  most  significant 
change  since  1968  is  the  fact  that,  in  the  meantime,  millions  and  millions  of  data  on 
literature  and  facts  on  just  about  every  field  of  human  interest  and  activity  have  been 
stored  and  will  be  added  to  by  millions  of  data  year  by  year.  The  methods  of  retrieval 
are  no  longer  a  crucial  problem.  Batch  processing  has  been  replaced  by  on-line 
processing.  On-line  dialogue  via  terminal  has  become  a  standard  procedure  and  modern 
communications-technology  is  providing  means  of  direct  access  to  almost  all  important 
data-bases  and  of  interconnecting  data-bases  and  users  within  large  networks. 


Not  only  has  the  quantity  of  stored  data  grown.  So  has  the  number  of  offered  services 
and  institutions.  And  the  so-called  "non-technical  factors"  are  influencing  the 
operations  of  the  information  systems  extraordinarily.  All  these  influences  are  carefully 
analyzed  in  the  Paper  of  Monsieur  Yanez,  starting  with  the  more  operational  ones  and 
stepping  over  to  the  political  and  economic  factors  and  the  manifold  legal  considerations, 
without  leaving  out  the  thorny  linguistic  problems.  Further  constraints  are  added  by 
the  various  communication-processes.  In  their  cumulation  these  influences  can  build  up 
high  obstacles  for  the  end-user.  Any  user  looking  for  pertinent  results  on  his  requests, 
especially  on  multi-subject,  problem-oriented  ones,  might  feel  himself  placed  right  in 
the  middle  of  an  almost  impenetrable  thicket  of  offered  services  with  differing  rules 
and  regulations,  and  with  unknown  overlaps  and  gaps  between  them.  He  can  find  the  best 
way  through  this  thicket  only  if  and  when  he  has  the  time  and  the  will  to  become  familiar 
with  the  complexity  of  the  conditions  which  he  is  facing. 


This  is  but  one  aspect  of  the  changes  on  the  users'  side  since  1966.  Doubtless  the 
number  of  users  of  information  systems  has  multiplied.  Those  using  these  facilities 
regularly  have  adapted  themselves  much  better  to  the  conditions  of  their  use.  It  would 
be  very  interesting  to  know  at  least  roughly  what  proportion  of  the  user-potential  is 
using  the  information  systems  regularly  and  the  existing  differences  from  country  to 
country. 


However,  I  believe  that  in  the  most  Important  respect  the  user  has  not  changed  at  all. 

He  has  remained  basically  what  he  was  in  1968:  a  single  human  being,  eager  to  solve 
his  professional  problems  and  eager  to  support  this  effort  by  getting  himself  informed 
as  well  as  possible,  but  -  alas!  -  with  the  same  limited  personal  capacity  and  limited 
time  for  digesting  and  evaluating  informative  data  and  texts.  In  view  of  the  tremendous 
development  on  the  suppliers ’  side  in  the  quantity  and  diversity  of  services  and  holdings 
he  is  much  worse  off  than  ever.  Even  those  conscious  of  all  the  advantages  of  being 
fully  Informed:  can  they  afford  to  devote  an  ever  higher  proportion  of  their  professional 
time  to  singling  out  and  digesting  what  would  be  helpful  for  them?  I  am  afraid  the 
structure  of  the  offered  services  is  too  complex.  Rarely  is  one  single  supplier  in  a 
position  to  answer  exhaustively  any  problem-oriented  question.  Yet,  even  incomplete 
services  are  asking  for  money  and  the  user  has  to  convince  his  boss  that  this  money  will 
be  well  spent. 


This  rather  glum  situation  for  the  user  demands  a  way  out  and  there  seems  to  exist  a 
helpful  solution:  enlisting  the  services  of  an  intermediary,  an  information-broker. 

By  so  doing  the  user  himself  can  concentrate  on  formulating  his  requirements.  His 
partner  will  -  in  the  ideal  case  -  unburden  him  from  the  choice  of  the  most  suitable 
information  services,  from  scanning  the  results  of  the  retrieval,  extracting  the  needed 
data,  from  supplying  the  texts  the  user  might  wish  to  read  and,  if  need  should  be,  from 
supplying  the  necessary  translations. 


In  larger  industrial  firms,  research  institutions,  and  administrative  units  users  can 
almost  always  rely  upon  this  kind  of  assistance  from  their  colleagues  in  the  in-house 
libraries  or  information  departments.  These  helpers  not  only  understand  the  problems 
and  needs  of  their  user-group  but  have  also  become  very  knowledgeable  on  the  different 
Information  services  and  their  specific  properties.  It  seems  to  be  this  group  of 
users  which  has  Installed  terminals  and  direct  access  to  existing  networks.  But  only 
some  of  the  users  are  in  thlB  privileged  position.  Those  in  smaller  and  medium 
organisations,  not  to  speak  of  individual  users,  are  in  a  less  favourable  position. 

There  seems  to  be  a  trend,  which  is  to  close  this  gap:  professional  societies,  special 
libraries,  and  as  well  chambers  of  commerce,  are  offering  their  own  particular  services 
and  the  demand  seems  to  be  already  sufficiently  well  developed  for  free-lance  information- 
brokers  to  enter  this  field  too. 
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I  might  be  mistaken  but  I  think  that  this  development  is  very  promising.  It  relieves 
the  end-user  from  the  task  of  having  to  become  familiar  with  the  often  not-so- 
transparent  structure  of  the  offered  services.  He  can  concentrate  on  his  real  task 
without  having  to  forego  all  the  stimulation  the  confrontation  with  recorded 
experiences  can  mean  to  him. 


There  are  a  few  more  aspects  within  the  changing  scene  since  1968  I  would  like  to 
turn  to.  Alvin  Weinberg,  in  his  historical  report,  set  much  store  on  the  quality 
aspect  and,  quite  logically,  exhorted  authors  to  maintain  as  high  a  standard  as  possible 
in  their  presentations.  I  think  all  of  us  are  familiar  with  the  strange  situation, 
that  those  users  requiring  information  are  much  more  demanding  than  they  used  to  be, 
when  they  are  authors  themselves.  In  retrospect,  it  can  certainly  be  stated  that 
Weinberg's  postulates  were  absolutely  justified  but  that  the  majority  of  authors  did 
not  act  accordingly.  The  consequence  is  that  this  slightly  malicious  remark  of 
Gene  Garfield  for  characterising  the  new  information  systems  seems  still  to  be  valid: 
"garbage  in,  garbage  out!".  I  am  afraid  there  is  no  easy  way  out.  Preselection 
before  input  means  an  invitation  to  unlimited  discrimination.  Consequently,  the  data 
bases  are  full  of  data  on  less  valuable  literature.  Unfortunately,  the  costs  of 
storing  these  data  don't  differ  from  those  of  storing  data  on  valuable  records.  The 
next  consequence  is  that,  at  the  user's  request,  data  will  be  furnished  on  relevant 
but  worthless  publications.  Weinberg  had  recommended  the  setting  up  of  special 
analysis  centres.  They  were  to  have  been  manned  by  highly  knowledgeable  specialists 
able  to  exercise  their  judgement  so  as  to  single  out  the  best  portions  of  the 
publications.  I  don't  know  whether  or  not  any  such  institution  has  been  created 
as  a  public  service. 


With  regard  to  the  quality  aspect  it  is  perhaps  worth  mentioning  that  in  very  many 
cases  the  user  is  no  longer  able  to  judge  the  quality  and  validity  of  the  service  of 
the  supplier.  The  only  thing  he  can  be  sure  of  is  that  he  has  to  pay  for  the  services 
received,  -  quite  Independent  of  the  true  value  of  the  service  received.  This  is  a 
strange  situation  which  could  be  tolerable  only  insofar  as  the  supplier  is  fully 
conscious  of  his  responsibility,  that  is,  when  he  accepts  the  position  of  trustee 
via-a-vis  the  user.  However,  is  this  position  as  a  trustee  compatible  with  the 
financial  constraints  of  any  supplier  who  has  either  to  make  money  or  at  least  to 
try  to  break  even?  It  would  be  so  easy  to  save  on  the  back  of  the  paying  user,  when 
one  can  hardly  get  caught  red-handed.  I  am  far  from  saying  that  this  actually  happens 
but  is  it  to  be  fully  excluded? 


In  this  same  context,  every  once  in  a  while  nowadays  the  term  "information-market"  is 
mentioned,  maybe  Intentionally,  for  describing  the  supplier-user  relationship.  This 
would,  of  course,  suggest  that  there  exists  a  market  with  rules  of  supply  and  demand 
and  with  a  healthy  competition  between  the  suppliers  as  a  regulatory  element.  I  am 
afraid  that  such  a  market  does  not  exist  in  reality.  On  the  suppliers’  side  we  are 
facing,  for  the  most  important  subject  fields,  de-facto  monopolies.  Nobody  can  afford 
to  double  up  INIS,  nor  INSPEC,  nor  CAS,  nor  any  other  of  the  outstanding  services,  and 
that  only  for  the  sake  of  competition.  The  financial  and  intellectual  effort  would  be 
too  big  and  as  well  too  risky.  So  we  must  wonder:  where  are  then  the  regulatory 
forces  of  the  market  which  are  supposed  to  protect  the  users'  Interest? 


Parallel  to  this  subject  monopoly,  which  is  as  significant  to  me  as  it  is  inevitable, 
we  are  facing  as  well  a  kind  of  linguistic  monopoly,  again  an  inevitable  one,  but  with 
consequences  for  important  user-groups.  It  is  a  fact,  just  a  part  of  the  history  of 
our  profession,  that  most  of  the  operational  databases  originated  in  English-speaking 
countries.  This  is  quite  understandable  because  here  was  located  the  highest  user 
potential.  Even  though  we  have  to  accept  this  fact  it  is  nevertheless  creating  problems 
for  the  supply  to  the  non-English  world  and  its  users.  Furthermore,  I  am  afraid  that 
this  situation  might  lead  -  again  quite  understandably  -  to  a  relative  under¬ 
representation  of  the  literature  in  languages  other  than  English.  Doubtless  this  would 
mean  a  discriminating  effect,  even  though  it  might  happen  quite  unintentionally,  if 
not  dictated  by  consideration  of  the  much  higher  cost  for  the  input  operation.  I 
feel  it  is  regrettable  that  there  are  not  more  databases  organized  like  INIS  or  AGRIS, 
where  equal  participation  is  the  basis  of  all  operations.  But  it  is  certainly  much 
too  late  to  expect  any  further  developments  of  this  kind. 


From  these  considerations  it  is  only  one  step  to  the  fact  that  national  Independence  in 
access  to  recorded  scientific  and  technical  data  simply  does  not  exist  for  a  large 
number  of  industrialized  countries.  The  artificial  creation  of  national  networks  does 
not  change  this  situation.  Moreover,  these  national  networks  do  already,  or  will  in 
the  near  future,  compete  with  international  commercial  networks.  Coming  back  once 
morr  to  the  idea  of  Alvin  Weinberg:  he  likened  information  to  other  essential  and 
ind,  jen sable  .ources  -  and  rightly  so.  But  this  might  mean  that  during  the  Eighties 

we  w.  ome  ,  face  a  similar  situation  in  our  field  as  already  exists  with  other 

resour  ,  Iiku  energy,  raw  materials,  etc. 


What  this  could  mean  in  the  long  run  remains  to  be  experienced.  At  worst  it  could  mean 
stepping  away  from  the  centuries-old  .radition  of  free  exchange  of  ideas,  theories, 
and  experiences  in  science  and  knowledge.  Together  with  the  trend  to  commercialisation 
of  information  services  it  could  very  well  mean  that  even  something  as  sensitive  as 
information  handling  might  become  more  and  more  subject  to  all  the  constraints  of 
commercial  survival.  This  would  put  all  the  legitimate  expectations  of  the  users 
at  the  mercy  of  forces  outside  of  their  interests. 


Unfortunately,  users  do  not  constitute  a  pressure  group.  Unlike  customers  in  any  normal 
trade  they  cannot  even  use  the  weapons  of  refusal.  Is  there  any  counterbalance  to 
these  rather  adverse  possibilities?  Personally,  I  believe  that,  so  long  as  we  are 
willing  to  heed  the  ideas  of  Alvin  Weinberg  and  to  maintain  the  outstanding  place  he 
gave  to  the  importance  of  this  world-wide  treasure  of  recorded  experience,  as  long  as 
we  continue  to  think  as  he  did,  we  cannot  refuse  nor  play  down  the  corresponding  serious 
responsibilities  on  the  suppliers'  side.  It  might  very  well  be  that  in  the  Eighties  many 
suppliers  will  become  torn  between  this  professional  responsibility  and  growing  financial 
or  commercial  pressures.  This  problem  might  become  a  steady  companion  to  any  supplier. 


As  regards  the  users,  I  do  hope  that  in  the  Eighties  the  manifold  attempts  at  giving 
higher  transparency  to  the  nature  and  content  of  the  offered  services  might  succeed  - 
together  with  the  hopefully  started  efforts  towards  a  broader  standardisation  of  the 
rules  and  regulations  for  the  use  of  the  offered  services.  Furthermore,  I  am  hoping 
that  fair  international  cooperation  can  go  on  and  does  not  become  victim  of  commercial 
competition.  I  don't  think  that  I  have  hidden  my  own  old-fashioned  concept,  that 
information  handling  is  and  should  remain  a  task  as  indispensable  as  it  should  be 
conducted  responsibly.  Therefore  I  consider  it  a  danger  to  act  in  this  field  in  the 
same  way  as  if  one  were  trading  in  any  whatsoever  commodity.  I  think  all  of  us  need  to 
do  our  best  to  try  to  develop  the  art  of  serving  our  users  in  the  best  possible  manner, 
if  ^nly  in  the  interest  of  contributing  to  the  survival  of  our  society. 
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Since  the  last  TIP  Specialists'  meeting  in  1968  technological  progress  dramatically 
changed  the  setting  and  the  capabilities  of  information  services,  '''he  routine  patterns  ol 
data  storage  and  retrieval  manipulations  have  been  fundamentally  restructured  and  the 
interface  between  system  and  user  is  now  bridged  over  by  powerful  linking  mechanisms. 

Most  of  the  major  developments  of  information  services  in  the  Intervening  period  can  be 
traced  to 

1.  advances  in  computer  design  and  automation, 

2.  improvements  in  telecommunication  and 

3.  the  union  of  computer  and  telecommunication  technologies. 

Computers  increased  their  storage  capacities,  core  memory  potential,  processing  speed, 
while  their  size,  cost  and  operating  expenses  dropped  significantly.  Telecommunication 
Introduced  new  modes  of  digital  data  transfer,  whose  cost  is  less  dependent  on  geographic 
distance,  and  is  now  developing  inter-city  and  inter-continental  connections  via 
inexpensive  fibre  material.  The  joint  power  of  computers  and  telecommunication  maintains 
various  modes  of  on-line  services,  links  remote  processing  hardwares,  supports  networks 
and  provides  global  dimensions  for  the  information  Industry. 

The  consequences  of  these  developments  dominate  the  modern  service  environment  where  the 
user  assumed  a  commard  position  over  electronic  machinery  and  telecommunication 
facilities.  Perhaps  the  most  notable  character  of  modern  information  services  is  the 
user' 8  participatory  role  and  his  changed  attitude  toward  the  information-gathering 
activity  which  was  once  time-consuming  and  cumbersome  and  became  easy  and  convenient. 
Either  directly,  or  through  an  Intermediary,  the  user  can  now  plug  into  an  international 
electronic  marketplace  where  information,  as  a  commodity,  is  in  abundant  supply  and 
readily  available  for  his  individual  use. 

It  is  also  of  importance  that  the  increasing  complexity  of  services  are  usually  not 
accompanied  by  an  Increase  of  sophistication  in  using  them.  The  rule  almost  generally 
applies  that  the  more  complex  a  system  is,  the  easier  its  user-interface  can  be  managed. 
Signing-on  procedures,  command  languages,  file  interrogation  techniques  and  various  other 
manipulations  can  normally  be  learnt  in  a  few  training  sessions. 

The  large  variety  in  the  service  capabilities  of  modern  information  systems  compels  me  to 
select  carefully  only  a  few  characteristics  and  features  which  qualify  as  "major 
developments".  I  have  chosen  the  following  four  properties  as  the  primary  potentials  of 
modern  information  services: 

1.  Searchabiiity  of  bibliographic  records  of  the  world's  scientific  and  technical 
literature. 

2.  Access  to  critically  evaluated  numeric  data  and  research  results. 

3.  Ability  to  bring  together  people  to  solve  problems  and  jointly  develop  ideas. 

A.  Means  to  locate  source  literature  in  full  length  and  reliable  document  delivery. 

While  reviewing  briefly  these  properties,  I  shall  refer  to  the  Canadian  experience  in 
scientific  and  technical  Information  services  only  to  illustrate  the  discussed  operational 
routines. 

1.  Searchabiiity  of  bibliographic  records  of  the  world's 
scientific  and  technical  literature 


According  to  the  1981  Asllb  directory  of  "Online  Bibliographic  Databases"  by  Hall  and 
Brown,  there  are  now  some  70  million  bibliographic  references  searchable  via  computer  and 
over  851  of  these  are  in  the  natural  sciences  and  technology. 

Data  files  covering  entire  disciplinary  fields  (chemistry,  health  sciences,  etc.)  or 
mission-oriented  research  areas  (oil  sands  technology,  occupational  safety),  specific 
types  of  publications  (technical  reports,  patents),  represent  the  main  body  of  data  which 
have  been  made  searchable  in  the  on-line  and  off-line  modes.  Data  bases  which  are 
commercially  accessible  through  the  vendors  of  the  information  industry  display  the 
literature  of  the  last  10-15  years.  This  is  sufficient  to  cover  the  time  span  of  the 
half-life  of  current  scientific  and  technical  knowledge. 

Beyond  the  publicly  available  stock  of  retrospective  files,  there  is  a  very  large  number 
of  private  data  bases  of  research  workers  and  institutions.  With  the  exception  of  the 


2-2 


proprietary  files  of  industrial  research  establishments,  some  of  these  private  data  bases 
are  often,  or  become  eventually,  available  to  a  professional  community  or  a  national 
clientele . 

The  most  popular  international  data  banks  are  electronically  stored  cumulated  versions  of 
printed  indexes  and  abstract  journals.  They  can  be  scanned  by  on-line  searching  or  used 
to  update  individuals  recurrently  through  the  selective  dissemination  of  personalized  sets 
of  citations. 

In  Canada  bibliographic  search  services  are  offered  by  the  private  and  public  sectors  of 
our  own  information  industry  and  by  international  vendors.  One  of  our  on-line 
bibliographic  search  system  is  CAM/ OLE  (Canadian  On-Line  Enquiry)  which  is  operated  by  the 
Canada  Institute  for  Scientific  and  Technical  Information  (CISTI),  a  division  of  the 
National  Research  Council  of  Canada.  CAN/OLE,  which  has  been  in  existence  since  1974,  now 
offers  18  files  for  searching  via  some  700  terminals  across  the  country.  The  files 
contain  about  12  million  citations  and  the  system  was  utilized  for  over  6000  connect  hours 
in  the  past  year.  The  Canadian  MEDLINE  network  operates  through  110  centres  at  the 
present  time  and  supports  17  files.  These  files  are  not  mounted  in  Canada  but  can  be 
accessed  via  direct  hook-up  with  the  MEDLARS  facility  of  the  U.S.  National  Library  of 
Medicine  in  Washington.  The  updating  portions  of  the  CAN/OLE  files  and  the  monthly 
supplements  to  the  MEDLINE  data  bases  are  also  searched  in  batch  mode  by  CISTI 's  Selective 
Dissemination  of  Information  service  (CAN/SDI)  which  has  been  in  operation  since  1969. 
CAN/SDI  now  serves  roughly  2200  subscribers  and  scans  over  3000  Individual  interest 
profiles  for  its  clientele. 

In  Canada,  as  in  any  industrialized  country,  bibliographic  search  services  are 
instrumental  in  the  promotion  of  economic  and  social  development.  Canada  produces  only 
about  3-4X  of  the  world's  scientific  and  technical  literature  and  is  a  "net  importer"  of 
information.  With  a  very  few  exceptions  all  countries  belong  to  the  same  category  and 
their  science  and  technology  must  tap  the  pooled  worldwide  resources  of  published 
knowledge.  It  is  certainly  one  of  the  major  developments  of  modern  information  services 
that  now  they  are  able  to  accomplish  this  essential  task. 

2 .  Access  to  critically  evaluated  numeric  data  and  research  results 

Scientific  research  is  a  prolific  generator  and  constant  user  of  measurements.  The  basic 
and  applied  scientist  or  the  engineer  must  often  consult  elaborate  tables  or  graphic 
representations  of  quantities  and  values,  handbooks,  laboratory  manuals  and  the  serial 
literature  to  find  facts  in  the  form  of  analyzed  and  evaluated  numeric  data.  A  relatively 
recent  development  is  the  compilation  of  numeric  data  in  data  bases  which  can  serve 
Investigators  as  research  tools  capable  of  two  essential  functions. 

Mounted  on  computers,  numeric  data  bases  not  only  offer  "look-up  information"  by  locating 
and  retrieving  items  of  interest  from  the  content  of  regularly  updated  files.  The 
selected  set  of  data  can  also  be  manipulated  at  will  in  a  wide  range  of  theoretical  and 
experimental  applications.  They  have  a  high  degree  of  maneuverability  and  graphic 
capabilities  further  support  their  versatile  use. 

It  is  expensive  and  very  laborious  to  create  and  maintain  numeric  files  and  rigorously 
assure  their  utmost  accuracy.  Despite  their  relative  scarcity  at  tne  present  time,  files 
are  being  developed  in  growing  numbers  in  the  United  States,  United  Kingdom,  federal 
Republic  of  Germany,  Trance,  Japan  and  elsewhere. 

To  exemplify  the  complexity  of  such  a  file,  let  me  point  to  one  data  base  that  originates 
in  the  Crystallographic  Data  Center,  Cambridge,  England,  and  contains  the  results  of  some 
28,000  independent  studies  of  25,600  chemical  compounds.  This  file  grows  at  the  average 
rate  of  3000  new  entries  annually.  The  three  linked  sub-files  of  the  Cambridge 
Crystallographic  Data  Base  contain  bibliographic  data  (name,  formula  and  class  of  the 
compound  and  literature  reference),  connectivity  data  (chemical  connectivity  of  atoms, 
bond  types,  etc.)  and,  finally,  comprehensive  data  on  the  properties  of  the  compound 
(unit  cell,  symmetry,  atomic  co-ordinates,  bond  length,  etc.). 

So  far  5  Canadian  numeric  data  bases  have  been  developed  in  the  fields  of  chemical 
engineering,  cuemlcal  thermodynamics,  ttiermochemistry ,  electrical  power  engineering  and 
solid  engineering. 

Since  1978  CISTI  has  made  accessible  in  Canada  the  SPIR  (Search  Program  for  Infrared 
Spectra)  data  base  with  some  143,000  entries.  The  system  enables  the  user  to  locate  the 
spectrum  closest  to  his  unxnown.  CRYSTOR,  the  Cambridge  Crystallographic  Data  Base,  which 
I  mentioned  earlier,  became  a  national  service  last  summer  and  is  now  in  full  operation  in 
the  on-line  mode. 

The  availability  of  manipulative  numeric  data  files,  with  their  specially  designed 
software,  should  be  considered  another  major  development  whose  impact  is  already  visible. 
It  is  expected  that  in  the  coming  years  this  variant  of  modern  information  services  will 
expand  its  sphere  of  applicability  and  become  one  of  the  common  tools  of  researchers  in 
scientific  and  technical  disciplines. 
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3.  Ability  to  bring  together  people  to  solve  problems 

and  jointly  develop  Ideas _ 

National  Inventories  of  ongoing  research  activities,  directories  of  projects  subsidized  by 
governmental  granting  agencies  and  private  foundations,  selective  listings  of  experts  in 
various  scientific  and  technical  specialties,  and  other  data  cumulations  of  this  kind, 
have  many  uses  in  policy  planning,  scientific  management,  in  monitoring  the  directional 
trends  of  scientific  communities,  etc.  They  also  fulfill  an  important  function  for  the 
working  scientist  and  technologist.  The  potential  benefits  of  this  function  (which  is 
usually  defined  as  "referral")  are  now  implemented  with  demonstrated  success. 

Scientists  often  need  intellectual  help  that  only  other  scientists  can  offer.  In  some 
cases  this  help  is  not  available  in  the  immediate  environment  of  the  investigator  and  the 
literature  does  not  contain  the  clues  which  can  lead  to  the  solution  of  his  problem. 
Science  and  technology  has  strong  communal  characteristics,  a  nationally  and 
internationally  scattered  population  that  is  stratified  and  contained  in  "invisible 
colleges"  and  various  types  of  insular  settings. 

There  is  a  class  of  information  which  can  be  found  only  in  the  laboratory,  close  to  the 
workbench  of  the  researcher.  Modern  information  services  can  tell  a  great  deal  to  their 
users  of  "who  is  doing  what  and  where",  "what  is  the  status  of  the  activities  of  others", 
and  "who  are  the  experts  of  an  investigative  topic".  The  basic  instruments  for  providing 
such  information  are: 

1.  research  inventories  (e.g.,  SSIE  in  the  USA,  IEC  in  Canada), 

2.  data  banks  for  the  "fugitive  literature"  (e.g.,  NTIS  for  technical  reports,  data 
bases  for  conference  presentations)  and 

3.  data  banks  on  expertise  (e.g.,  KSI  in  Canada). 

Research  inventories  in  machine -readable  form,  usually  with  hard  copy  or  microform 
versions,  are  descriptive  records  of  individual  projects  which  identify  principal 
investigators  and  co-workers  by  name  and  affiliation.  To  some  record  formats  a  string  of 
keywords  or  descriptors  from  a  controlled  vocabulary,  and  often  abstracts,  are  added  and 
also  locating  information  concerning  the  availability  of  progress  and  final  reports 
generated  by  the  project. 

Variations  in  scope  and  comprehensiveness  range  from  narrow  sectorial  compilations  (such 
as  Arctic  research)  to  an  all-inclusive,  retrospective  and  minutely  annotated  search 
system  which  is  maintained  by  the  Smithsonian  Science  Information  Exchange.  The  Canadian 
IEC  (Information  Exchange  Centre)  file  is  limited  to  federally  supported  academic  research 
projects  since  1970  with  an  annual  growth  of  about  10,000  entries. 

The  "fugitive  literature"  of  technical  reports  display  scientific  and  technical  activities 
in  the  close-up  dimensions  of  magnified  details.  Their  eluslveness  is  due  to  the  limited 
circulation  of  accounts  on  governmental  in-house  projects,  contracted  research  and 
activities  with  grant  support.  Access  to  U.S.  report  material  via  on-line  search,  and  in 
microfiche  and  hard  copy  form,  through  the  National  Technical  Information  Service  (NTIS) 
is  invaluable  for  the  working  scientist  and  engineer.  Other  searchable  sources  of  smaller 
proportions,  such  as  the  CODOC  data  base  of  the  CAN/OLE  system  in  Canada,  are  further 
examples  of  this  service.  Papers  presented  at  conferences  and  translation  of  research 
articles  from  foreign  languages,  which  also  belong  to  the  fugitive  literature,  have  become 
more  manageable  in  recent  years  by  IS I  in  the  U.S.  and  a  number  of  other  data  services. 

Data  banks  containing  records  of  subject  experts,  with  well-defined  indicators  of  their 
specialty,  are  often  not  made  public  because  this  service  must  be  carefully  protected  from 
abuse.  The  task  of  this  sensitive  utility  is  to  promote  direct  personal  contacts, 
consultative  links  and  partnership  of  people  and  all  the  related  functions  are  dependent 
on  the  consent  and  circumstances  of  individuals.  The  Knowledge  Source  Index  (KSI)  with 
the  detailed  and  continuously  updated  records  of  some  3500  Canadian  experts  in  government, 
universities  and  industry,  is  one  of  the  examples  of  this  information  servlce- 

Creating  awareness  of  the  working  interests,  accomplishments  and  the  know-how  of  members 
of  professional  communities  has  far-reaching  significance  in  the  advancement  of  scientific 
and  technical  knowedge  and  practice.  Effectively  used,  this  type  of  service  has  great 
potentials  in  the  team- format ion  and  personal  orientation  of  individuals  and,  therefore, 
deserves  the  rank  of  "major  information  services". 

4.  Means  to  locate  source  literature  in  full  length  and 

reliable  document  delivery _ 


The  70  million  bibliographic  citations  displayed  by  on-line  services  recommend  reading 
material  to  the  user  to  satisfy  his  information  need.  The  result  of  a  successful 
bibliographic  search  is,  therefore,  the  first  action  in  a  chain  of  events  which  leads  to 
the  acquisition  and  use  of  the  information.  The  three  further  steps,  following  the 
bibliographic  identification,  are: 

1.  location  of  the  cited  item, 

2.  requesting  the  material  from  a  source,  and 

3.  delivery  of  the  document. 
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Machine-readable  catalogues  of  monographic  and  serial  collections,  automated  union  lists, 
became  standard  sources  for  location  Information  in  the  past  decade.  Computer-produced 
microfiche  (COM)  versions  of  these  tools  and  conventional  hard-copy  volumes  are  also 
widely  used. 

Information  systems  often  support  the  location  search  by  including  source  data  in  the 
citation  format  or  by  mounting  catalogues  and  union  lists  on  the  same  computer  system 
which  provides  the  bibliographic  files  for  search.  After  completing  the  search 
manipulations ,  the  user  can  switch  to  a  national  holding  file  on  the  same  system  to  find 
immediately  a  source  for  the  photocopy  he  wants  to  order.  During  this  maneuvre  he  can 
find  out  whether  his  own  library  holds  the  required  item  or,  if  not,  which  is  the  holding 
library  closest  to  him. 

in  Canada  the  UTL.AS  (University  of  Toronto  Library  Automation  System)  maintains  a 
catalogue  of  the  major  academic  and  governmental  collections  in  searchable 
machine-readable  form.  A  Canadian  version  of  the  DOBIS  library  management  system  of 
Dortmund  University  in  the  Federal  Republic  of  Germany  was  jointly  developed  by  the 
National  Library  and  CIST1  for  shared  cataloguing  functions  among  its  fields  of 
application.  CISTI's  on-line  information  system,  CAN/OLE,  Includes  the  national  union 
list  or  scientific  and  technical  serials  in  a  set  of  18  data  bases,  along  with  detailed 
files  on  the  document  collections  of  two  important  governmental  libraries:  the  departments 
of  Transport  and  Environment. 

On-line  document  ordering  and  the  so-called  electronic  mailbox  facility  were  designed  to 
automate  the  document  requesting  phase.  This  development  took  place  in  the  past  5-6  years 
and  since  then  became  a  component  of  large  North  American  and  European  Information 
networks . 

CISTI  backs  up  the  CAN/ OLE,  MEDLINE  and  CAN/SDI  services  with  photocopies  from  its  serial 
collection  of  32,000  titles.  Via  on-line  ordering,  telex  and  mall,  184,000  requests  were 
sent  to  us  last  year.  Of  this  volume  of  requests  76Z  was  filled  from  CISTI's  own 
collection  and  another  201  were  forwarded  to  the  other  253  scientific  and  technical 
libraries  whose  holdings  are  included  in  the  union  list. 

Document  delivery,  the  final  step  in  the  chain  of  events,  is  still  carried  out  by 
conventionally  transporting  the  item  from  the  source  library  to  the  user.  While  holding 
information  can  be  obtained  by  automatic  means  and  the  ordering  routine  for  photocopies 
and  loans  has  been  enriched  with  automated  elements,  we  cannot  report  at  this  time  major 
developments  in  document  delivery.  Telefacsimile  transmission  of  full  text  has  not  been 
employed  on  the  large  scale  and  videotext  technology  is  still  a  promise  for  the  future. 

The  only  Canadian  innovation  in  this  area  is  the  bulk  delivery  of  items  from  CISTI  in 
Ottawa  to  distant  cities.  The  combination  of  courrier  service  and  air  transport  carries 
document  packages  to  drop-off  points  from  where  they  are  locally  distributed  to  their 
final  destination.  Considering  that  about  SOX  of  all  the  requests  filled  by  CISTI  have  a 
24-hour  internal  turn-around  time,  and  bulk  delivery  speeded  up  considerably  the 
transportation  procedures,  cue  time  frame  of  this  transaction  was  reduced  to  the  minimum 
under  the  present  circumstances. 


* 

I  have  attempted  to  review  the  major  developments  in  modern  information  services  with 
examples  taken  from  the  Canadian  experience.  The  review  1  have  presented  was  very 
selective  and  focussed  on  the  user-related  operational  nature  of  services,  leaving 
essential  related  topics,  technical  Issues,  managerial  and  economic  problem  areas, 
projections  for  the  future,  to  other  presentations  in  thr  program. 


We  live  in  an  age  of  constant  developments  and  our  professional  environment,  our  tools  and 
practices,  are  always  transitional.  Whenever  we  meet  there  are  new  major  developments  to 
talk  about  -  and  this  is  the  most  stimulating  aspect  of  our  profession. 
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Summary : 

The  driving  force  behind  the  growth  of  data  processing  up  till  now  has  been  the  explo¬ 
sive  development  of  technology.  The  availability  of  semiconductor  chips  with  an  ever- 
increasing  component  density  has  made  a  continuous  improvement  in  the  price-performance 
ratio  of  computer  systems  possible. 

During  the  seventies  the  mainframe  power  of  an  average  dp-installation  increased  at  least 
tenfold,  the  main  memory  size  by  a  factor  of  30  and  disk  memory  by  a  factor  of  100.  Over 
the  same  period  floor  space,  power  consumption  and  costs  decreased. 

The  utilization  of  dp-systems  was  improved  by  innovations  in  systems  architecture  and 
software.  Typical  examples  of  this  are  virtual  addressing,  microprogramming,  improved 
protection  mechanisms  and  data  base  systems. 

The  use  of  telecommunications  in  computing  applications  has  expanded  rapidly.  Between 
1970  and  1980  the  number  of  terminals  in  use  in  the  Federal  Republic  of  Germany  increased 
by  a  factor  of  ^ 5 ,  similar  increases  are  also  common  in  other  countries.  During  the  same 
period  data  networks  for  public  use  appeared  in  all  major  western  countries. 

Due  to  differences  in  telecommunications  law,  data  networks  in  the  United  States  and 
Europe  evolved  in  different  ways.  Standardization  activities  in  this  field  will  form  a 
basis  for  generally  available  public  data  networks  in  the  near  future.  During  the 
eighties  low  cost  connections  to  data  networks  will  be  made  available  to  the  private 
citizen. 

At  the  end  of  the  sixties  information  storage  and  retrieval  systems  were  only  available 
in  a  rudimentary  form,  today  however  quite  sophisticated  real-time  information  systems 
are  employed.  As  a  result  of  the  progress  made  in  dialog-processing  and  access  methods 
for  non-specialized  users  information  systems  have  enjoyed  increasing  popularity. 

At  present  research  is  being  carried  out  on  natural  language  text  processing. 

In  the  future  character  recognition,  speech  processing,  intelligent  terminal  systems  and 
public  data  networks  will  be  the  center  of  interest. 

1.  Hardware  developments 

Three  3  key  factors  have  contributed  to  the  recent  development  of  large  CPUs 

Progress  in  semiconductor  technology  between  1970  and  1980  has  made  it  possible 
to  increase  the  number  of  logic  functions  implemented  on  a  silicon  chip  by  a  factor 
of  100a 

Semiconductor  memories  with  an  access  time  of  less  than  10  nsec  are  available. 

Techniques  which  permit  the  packing  of  a  large  number  of  silicon  chips  in  a 
very  small  volume  have  been  developed 

This  means  that  CPUs  capable  of  carrying  out  several  million  operations  per  second  can 
be  constructed  whereas  the  present  costs  are  only  a  fraction  of  the  costs  involved  ten 
years  ago. 

Since  1970  main  memories  have  generally  been  implemented  using  semiconductor  memory  and 
capacities  of  several  million  characters  are  quite  common.  Disk  memory,  with  capacities 
of  a  few  hundred  million  characters  per  disk  drive,  has  been  developed  and  used  as 
secondary  memory.  Attempts  to  find  a  memory  accessible  by  purely  electronic  methods,  e.g. 
bubble  memory,  and  so  replace  disk  memories  which  operate  on  an  electromechanical  basis, 
have  not  been  successful.  Development  work  on  magnetic  tape  memory  has  been  very  tardy 
and  apart  from  improving  the  recording  density  by  a  factor  of  4  no  further  innovations 
have  been  made. 

In  the  seventies  there  were  several  attempts  to  market  electro-mechanical  tape  library 
systems,  but  none  of  these  were  resoundingly  successful. 

A  landmark,  as  far  as  output  devices  are  concerned,  is  the  appearance  of  the  first  elec¬ 
tro-photographic  high  performance  printer  on  the  market.  These  devices  operate  using  a 
laser  printer  system.  Apart  from  a  very  high  printing  speed,  they  offer  character  sets 
which  can  be  exchanged  electronically  and  graphics.  The  matrix  printer  is  by  far  the  most 
popular  type  of  slow  printer  in  the  lower  price  range.  The  ink  jet  printer  is  another  type 
of  printer  which  has  been  developed  in  the  meantime.  It  operates  silently  and  many  are 
capable  of  producing  printouts  in  several  colours. 

The  microprocessor,  whose  capacity  and  complexity  is  continually  increasing  has  been  and 
will  continue  to  be  a  key  factor  in  the  development  of  hardware  for  terminals  and  small 
computers. 
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2 .  Architecture  and  software 


Looking  back  over  the  last  10  years  it  is  obvious  that  innovations  in  computer  architec¬ 
ture  are  just  as  important  as  the  technological  developments  that  have  taken  place. 

At  the  beginning  of  the  seventies  it  could  already  be  foreseen  that  the  increasing  com¬ 
plexity  of  data  processing  systems  would,  sooner  or  later,  cause  serious  reliability 
problems.  The  large  number  of  components  constituting  a  data  processing  installation  and 
the  certainty  that  this  number  would  continue  to  increase  was  a  reason  for  fearing  that 
random  faults  would  occur  so  frequently  that  the  time  between  breakdowns  would  only  be  a 
few  hours.  This  problem  has  however  been  solved  by  means  of  automatic  error  correcting 
procedures.  Today,  for  example,  errors  in  the  main  memory  are  usually  corrected  auto¬ 
matically. 

In  the  past  computer  commands  were  often  implemented  in  the  hardware  of  the  device,  now¬ 
adays  microprogramming  means  that  an  increasing  number  of  more  powerful  commands  can  be 
added  to  the  computer's  repertoire. 

Two  main  advantages  resulted  from  the  increase  in  main  memory  size  -  increasingly  larger 
and  more  complex  programs  coul  '  -.9  run  and  computer  time  could  be  shared  simultaneously 
by  a  greater  number  of  users.  This  meant  that  the  danger  of  an  error  in  one  program  caus¬ 
ing  other  programs  to  malfunction  became  more  acute.  Virtual  addressing  and  protection 
mechanisms  were  introduced  to  eliminate  these  problems.  In  this  way  the  access  of  indi¬ 
vidual  users  to  data  and  programs  could  be  checked. 

Multiprocessor  systems  were  created  when  the  demands  for  more  computer  power  could  no 
longer  be  satisfied  by  the  largest  processors.  This  involved  distributing  the  system 
load  among  several  processors  which  operate  together  in  a  well  defined  way.  The  main 
memory  of  the  system  was  also  replaced  by  a  memory  hierarchy  consisting  of  a  small  fast 
memory  and  a  very  large  slow  one.  These  changes  resulted  in  a  marked  increase  in  operat¬ 
ing  speed. 

In  the  early  days  of  data  processing  the  manufacturer  delivered  a  very  simple  operating 
system,  which  realized  a  few  basic  functions,  in  addition  to  the  hardware.  Since  then 
system  programs  with  several  million  commands  have  been  developed.  Today  many  of  the 
functions  the  user  requires  are  already  provided  by  the  operating  system  which  has  stan¬ 
dard  interfaces.  This  means  that  the  user  no  longer  has  to  spend  as  much  time  as  he  did 
before  writing  his  own  programs. 

At  first  it  was  feared  that  the  increasing  number  of  elements  in  operating  system  codes 
would  cause  an  increasing  program  error  rate  and  consequently  lower  stability  and  re¬ 
liability.  In  the  meantime  however  new  procedures  for  designing  and  implementing  soft¬ 
ware  systems  were  developed  in  a  new  branch  of  applied  science  called  software  engineer¬ 
ing.  The  result  of  these  new  procedures  was  increased  software  quality  and  performance. 
Great  advances  have  been  made  in  data  security  procedures  and  methods  of  restarting  the 
system  when  faults  occur. 

The  costs  involved  in  system  software  development  have,  in  the  course  of  time,  become 
considerably  larger  than  those  of  hardware  development.  For  this  reason  payment  in  ad¬ 
vance  for  system  software  is  a  condition  imposed  by  all  system  suppliers.  This  led  to 
the  creation  of  a  software  market  with  strong  competition  between  numerous  software 
houses  and  the  system  suppliers.  Today  the  user  can  buy  ready  made  software  packages  to 
solve  many  of  his  software  problems. 


3.  Telecommunications  and  Data  Networks 


Teleprocessing  has  existed  since  the  inception  of  data  processing  approximately  30  years 
ago.  The  main  function  of  teleprocessing  then  was  allowing  the  user  to  carry  out  input/ 
output  operations  at  a  location  remote  from  the  computer  centre.  Remote  batch  stations 
connected  to  dedicated  lines  were  used  for  this  purpose.  In  the  late  sixties  video  units 
became  available  and  made  dialog  between  user  and  system  possible,  this  dialog  was  fre¬ 
quently  effected  via  the  telephone  network.  At  first,  because  of  technical  limitations, 
the  lines  were  connected  to  the  CPU  by  means  of  hard-wired  control  units.  Teleprocessing 
accessing  methods  were  incorporated  in  the  operating  system  to  make  a  few  basic  functions 
available  to  the  user.  The  first  time-sharing  operating  systems  were  introduced  around 
1965*  Time-sharing  is  the  allocation  of  a  certain  amount  of  CP  time  to  each  user  in  turn. 
This  means  that,  in  effect,  each  user  has  the  complete  system  at  his  disposal.  This  mode 
of  operation  is  especially  suitable  for  interactive  programming  and  scientific  applica¬ 
tions  where  each  user  is  executing  his  own  specialized  jobs.  This  is,  however,  not  the 
case  for  commercial  applications  where  many  users  want  to  run  the  same  programs  and 
access  the  same  data  more  or  less  simultaneously.  Computerized  booking  systems  or  systems 
for  accessing  shared  data  bases  are  applications  of  this  kind.  Clearly  time-sharing  is 
not  appropriate  and  the  approach  adopted  must  be  the  serialization  of  individual  jobs  by 
special  system  software  components.  In  1970  the  first  TP  monitors  wete  developed  to  solve 
the  serialization  problem  as  well  as  other  problems  relating  to  safeguarding  and  program 
restart.  This  made  transaction  processing  possible  and  after  1975  the  most  important 
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transaction  functions  became  available  in  operating  systems.  Since  then  data  processing 
has  been  applied  to  more  and  more  areas  of  commerce  where  a  large  quantity  of  similar, 
repetitive  tasks  have  to  be  carried  out.  Over  the  same  period  of  time  the  number  of 
dedicated  lines  increased  so  rapidly  that  each  group  of  lines  had  to  be  controlled  by  a 
satellite  computer. 

Private  data  networks  were  set  up  shortly  afterwards  using  dedicated  lines  and  concen¬ 
trators,  today  for  example,  large  airlines  have  networks  interconnecting  thousands  of 
terminals  all  over  the  world.  The  telecommunications  laws  in  Europe  and  the  United  States 
are  very  different  and  this  is  why  the  data  networks  in  both  countries  have  followed 
different  lines  of  development.  In  Europe  the  administrations  have  a  telecommunications 
monopoly  as  private  citizens  are  prevented  by  law  from  setting  up.  telecommunication 
systems.  Thi*-  means  that  the  development  of  data  processing  in  Europe  was  heavily  de¬ 
pendent  on  the  development  of  public  networks.  West  Germany  is  typical  in  this  respect 
as  over  50J  of  terminals  are  operated  via  the  public  telephone  network.  Approximately 
40>  use  a  public  dedicated  line  network  which  was  set  up  by  the  telecommunications  ad¬ 
ministration  in  the  early  seventies  when  the  rapid  growth  of  private  line  communication 
systems  threatened  the  state  monopoly.  A  new  digital  dialing  network  (DATEX  L)  has  also 
been  set  up  recently.  A  milestone  in  the  development  of  public  networks  was  marked  by 
the  introduction  of  EURONET  at  the  end  of  the  last  decade.  In  1980  a  public  packet 
switched  network  (Datex  P)  went  into  trial  operation  in  West  Germany.  Table  1  gives  an 


overview  of  the  increasing  use 
Germany . 

of  data 

terminals  in  the 

various 

public  services  in  West 

Table  1 
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*v 
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Dedicated  line  network 

- 

- 
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Datex  network 

60 
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3548 

other  networks 

97 

2447 

7770 

4234 

Total  number: 

327 

4258 

37346 

107493 

In  the  United  States  teleprocessing  networks  operated  by  the  private  sector  and  avail¬ 
able  for  general  use  were  introduced  a  long  time  ago.  APRANET,  TELENET  and  TYMNET  are. 
typical  systems  of  this  kind.  New  techniques  such  as  packet  switching  were  tried  out  in 
these  networks  which,  as  a  rule,  were  only  available  to  certain  user  groups  -  research 
institutes  or  the  customers  of  service  computer  centres  for  example.  Generally  speaking 
these  networks  did  not  offer  a  complete  service  as  far  as  area  coverage  was  concerned. 
However,  because  the  market  for  high  performance  data  processing  has  grown  so  rapidly  in 
recent  years,  AR  &  T  and  other  large  organizations  are  considering  setting  up  data  pro- 
cessing  networks  with  full  area  coverage.  As  well  as  the  transfer  and  storage  of  data 
these  networks  will  also  carry  out  a  certain  amount  of  data  processing.  Attempts  are  also 
being  made  to  set  up  teleprocessing  networks  using  satellites.  The  main  characteristic  of 
such  systems  is  their  high  data  transmission  rate,  which  also  makes  other  services  such 
as  picture  transmission  or  video  conferences  feasible.  Because  the  cost  of  satellite 
ground  stations  is  still  relatively  high,  satellite  networks  will  only  be  accessible  to 
users  with  considerable  resources  at  their  disposal. 


During  the  seventies  the  developers  of  data  networks  in  Europe  lost  ground  to  their 
American  counterparts  because  they  were  prevented  from  setting  up  private  networks.  How* 
ever,  as  the  eighties  approached,  the  situation  changed  as  the  techniques  used  in  digital 
dialing  networks  had  reached  such  a  high  degree  of  development  that  they  could  be  made 
subject  to  international  standards.  After  the  CCITT  published  the  X-series  recommenda¬ 
tions  work  started  on  the  construction  of  public  digital  dialing  networks  in  most  Euro¬ 
pean  countries.  In  the  years  to  come  these  networks  will  offer  a  range  of  remarkable  new 
services,  which  could  also  be  important  for  information  and  documentation  centres: 

a)  TELETEX  offers  a  new  kind  of  telex  service  with  a  high  transmission  rate  and  a  large 
character  set.  This  service  is  primarily  intended  for  text  commune lat ion  and  office 
automation,  it  will  probably,  to  some  extent,  replace  the  existing  telex  service. 

The  result  of  this  will  be  the  development  of  a  new  type  of  intelligent  printer  ter¬ 
minal,  which  can  also  be  used  in  teleprocessing 
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b)  PRESTEL,  the  TELETEX  system  originally  developed  in  Britain,  is  now  being  tested  in 
several  European  countries.  In  this  system  inexpensive  user  terminals  are  connected 
via  the  existing  telephone  network.  Teletex  will  be  standardized  in  the  near  future 
and  is  particularly  well  suited  for  dialog  with  data  bases  and  for  booking  and 
ordering  services.  It  can  also  provide  full  area  coverage  with  transaction  proces¬ 
sing.  Experts  predict  that  travel  bookings,  mail  order  purchases  as  well  as  other 
financial  transactions  will  all  be  possible  from  one's  own  home  in  a  few  years  time. 
The  replacement  of  telephone  directories  by  very  low  cost  terminals  employing  a 
retrieval  system  is  an  idea  originating  from  the  French  Telecommunications  Authori¬ 
ties  which  is  worth  mentioning. 

c)  The  first  fully  digital  telephone  exchanges  have  been  operating  in  Great  Britain 
since  198O  (System  X).  In  addition  to  digital  telephone  traffic  the  exchanges  can 
also  carry  out  teleprocessing.  Agreement  about  the  standardization  of  European  digi¬ 
tal  telephone  systems  has  almost  been  reached  in  discussions  at  CEPT.  Using  this 
system  it  is  envisaged  that  speech  communication  will  take  place  at  64  Kbit/s., 
digital  traffic  is  also  possible  as  is  the  transition  to  other  public  networks.  The 
first  prototypes  using  these  new  techniques  are  expected  to  go  into  operation  around 
1983.  It  is  probable  that  several  European  telecommunication  authorities  will  start 
introducing  Integrated  Services  Digital  Networks  (ISDN)  in  1985 

d)  Experiments  on  wide  band  communication  using  co-axial  cables  or  optical  fibres  have 
been  carried  out  all  over  Europe.  These  systems  have  a  large  bandwidth  but  are  in 
general  unsatisfactory  for  dialog  traffic,  as  they  usually  have  a  tree  structure. 

e)  In  the  latter  half  of  this  decade  satellite  communications  will  become  increasingly 
important  in  Europe.  Telecommunications  authorities  are  also  considering  the  pos¬ 
sibility  of  making  wideband  services  available  to  the  private  user. 


In  conclusion  one  can  say  that  after  several  years  of  divergent  development  in  Europe 
and  the  United  States,  the  European  authorities  have  become  very  active  in  the  field  of 
public  data  networks.  This  would  lead  one  to  expect  that  users  will  soon  be  able  to 
benefit  from  the  advantages  of  public  networks,  i.e.  full  area  coverage,  guaranteed 
quality  of  service  and  standard  interfaces. 

The  variety  of  digital  networks  and  transmission  procedures  has  considerably  increased 
the  complexity  of  hardware  and  software.  Also,  because  of  the  rapid  development  of 
private  networks  in  the  USA  in  the  mid-seventies,  almost  all  the  important  manufacturers 
of  data  processing  equipment  have  their  own  "network  architectures",  which  are,  in 
general,  not  mutually  compatible.  The  ISO  has  been  trying  to  gain  acceptance  for  the 
standardization  of  a  reference  model,  which  describes  the  functions  of  teleprocessing 
systems  in  terms  of  function  layers.  CEPT,  CC1TT  and  ISO  have  begun  to  publish  recom¬ 
mendations  and  standards  for  teleprocessing  systems  based  on  this  model  and  it  is  hoped 
that  this  will  improve  the  compatibility  of  data  processing  systems  in  the  years  to  come. 
No  technological  advance  mentioned  previously  has  greater  significance  for  teleproces¬ 
sing  than  the  microprocessor.  Without  the  microprocessor  it  would  not  have  been  possible 
to  construct  intelligent  terminals  which  have  all  the  characteristics  of  a  small  com¬ 
puter  system.  These  devices  are  very  common  today  in  personal  computers  and  the  number 
of  applications  is  increasing  rapidly.  In  the  future  they  will  be  equipped  with  their  own 
small  c'ata  bases  opening  up  further  possibilities  for  the  user. 


4.  Information  Storage  and  Retrieval 


At  the  end  of  the  sixties  information  storage  and  retrieval  systems  were  still  in  a  rudi¬ 
mentary  stage  of  development.  Higher  functions  could  not  be  realized  as  the  required  CPU 
speeds  were  not  available  and  economic  problems  often  arose  because  memory  costs  were 
high.  However  the  situation  has  changed  dramatically  in  the  course  of  the  last  10  years. 
Modern  data  processing  systems  can  effectively  store  a  very  large  amount  of  data  and  are 
able  to  accommodate  a  large  number  of  users  simultaneously. 


In  the  seventies  the  development  of  information  storage  and  retrieval  was  determined  by 
three  factors: 


1)  -  Batch  processing  in  a  computer  centre  was  the  usual  mode  of  operation  in  the  sixties. 

However  1970  saw  a  breakthrough  in  the  form  of  on-line  systems  which  could  be  used  by 
many  users  in  dialog  operation. 

2)  -  The  advances  made  in  teleprocessing  and  network  design  led  to  the  formation  of  extensive 

information  networks  such  as  ARPANET,  TYMNET  or  EURONET.  In  their  turn  these  develop¬ 
ments  stimulated  the  construction  of  new,  public  switched  data  networks  in  Europe. 

3)  -  While  large  data  bases  were  being  developed  and  set  up  for  use  in  many  areas  of  science 

and  technology,  an  analogous  application  of  teleprocessing  to  information  storage  and 
retrieval  was  also  taking  place. 
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To  recapitulate  we  can  say  that  information  storage  and  retrieval  systems  have  become 
commercially  viable  in  recent  years  and  that  an  information  industry  has  been  created 
which  will  be  probably  be  a  sine  qua  non  for  future  economic  development. 

Most  commercial  information  storage  and  retrieval  systems  in  operation  today  are  de¬ 
signed  using  principles  that  were  already  familiar  10  years  ago.  This  means  that  prac¬ 
tically  all  storage  and  retrieval  algorithms  are  based  on  Boolean  logic.  The  conclusion 
that  no  radical  change  has  taken  place  in  this  field  over  this  period  is  inescapable. 
Mathematical  procedures  for  measuring  the  similarity  of  data  had  already  been  proposed 
at  the  beginning  of  the  seventies,  however  they  could  not  be  realized  in  practice  because 
the  demands  they  made  on  computer  resources  were  too  great.  The  increasing  quantity  of 
documentation  which  has  to  be  stored  in  modern  retrieval  systems  is  one  cogent  reason 
for  assuming  that  these  methods  will  not  find  much  favour  in  the  near  future. 

The  automatic  indexing  of  information  has  not  proved  to  be  commercially  viable  even  though 
several  effective  programs  have  been  developed.  On  the  other  hand  a  commercial  break¬ 
through  in  the  near  future  is  likely. 

Linguistic  data  processing  methods  have  been  implemented  in  experimental  systems,  but  as 
yet  no  wide  practical  application  for  them  has  been  found.  Nevertheless  proofs  exist 
showing  what  the  exact  limits  of  such  methods  are.  Methods  of  evaluating  the  semantic 
content  of  information  will  not  be  available  in  the  foreseeable  future. 

The  use  of  commercial  data  base  systems  has  grown  rapidly  in  the  seventies  and  to-day 
the  use  of  data  bases  is  common  practice.  Over  the  last  10  years  progress  has  resulted 
from  research  into  various  data  models,  of  which  the  relational  model  has  made  the  great¬ 
est  impression.  The  main  area  of  debate  for  experts  in  the  field  has  been  the  standard¬ 
ization  of  data  base  interfaces.  CODASYL  was  the  starting  point  for  the  discussion  and 
is  supposed  to  facilitate  the  portability  of  user  programmes  for  various  data  base 
systems.  It  is  worth  noting  that  standardization  was  limited  to  the  data  base  systems  as 
the  user  interfaces  of  information  storage  and  retrieval  systems  are,  in  general,  used 
in  processing  dialog  operation  and  this  meant  that  the  standardization  of  programming 
interfaces  did  not  seem  to  have  the  same  importance.  New  developments  in  data  base 
systems  were  also  accompanied  by  greater  ease  of  use,  providing  numerous  aids  that  make 
life  simpler  for  the  system  user  whether  he  is  an  application  programmer  or  a  dialog  sub¬ 
scriber.  These  aids  include  interactive  query  languages,  report  generators  and  data  dic¬ 
tionaries.  Automated  data  base  design  has  also  simplified  the  complex  problems  encoun¬ 
tered  in  this  field. 

Two  main  trends  can  be  seen  in  the  future  development  of  data  base  technology  -  the  inple- 
mentation  of  extremely  large  data  base  systems  and  data  base  decentralization. 

Even  today  the  problems  associated  with  "distributed  data  bases"  have  not  been  fully 
solved.  There  is  a  need  for  systems  in  which  data  is  stored  at  many  locations  in  a  com¬ 
puter  network,  furthermore  all  the  network  subscribers  should  be  able  to  access  and  up¬ 
date  all  this  data.  Many  limited  versions  of  these  systems  have  already  been  realized, 
however  no  one  has  yet  succeeded  in  designing  a  system  that  has  all  the  characteristics 
of  a  single  large  data  base.  This  requires  the  transfer  of  an  extremely  large  amount  of 
data  between  the  various  computers  in  the  system  which  in  turn  necessitates  powerful 
CPUs.  Judging  by  the  present  development  of  public  data  networks  it  seems  likely  that 
data  transfer  costs  will  decrease  sharply  and  that  sufficiently  high  transmission  rates 
will  be  obtained  in  the  years  to  come. 

Remarkable  progress  has  been  made  in  the  data  acquisition  of  information  systems: 

Machines  can  now  read  typewritten  text  or  text  written  in  capitals  at  a  much  lower  cost 
than  before.  This  means  that  devices  which  are  capable  of  processing  written  information 
more  efficiently  than  ever  before  will  soon  be  appearing  on  the  market. 

Software  systems  which  can  analyse  natural  language  could  form  the  basis  for  the  auto¬ 
matic  extraction  of  descriptors  from  documents,  it  should  also  be  possible  to  construct 
dialog  systems  which  can  communicate  with  data  bases  in  natural  language. 
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RESUME 

On  rappelle  le  contexte  gdndral  de  Involution  vers  1 'fere  de  1 ' information. 
Tout  converge  pour  montrer  que  le  marchb  de  1 ’ information  sera  porteur,  si  nous  savons 
construire  de  bons  systdmes  d ' in  forma t ion  et  agir  dans  ce  but  sur  les  facteurs  d'organi- 
sation,  les  facteurs  politiques,  les  facteurs  bconomiques  et  les  facteurs  juridiques. 

Ces  quatre  points  ne  sont  pas  traitds  exhaustivement ,  mais  on  s'effor ce  de  mettre  en 
relief  certaines  actions  considbrdes  comme  prior i tai res ,  signaler  des  dcueils  6  dviter, 
et  prbparer  ainsi  dfes  b  present  les  systbmes  d ’ in  formation  des  anndes  1980. 


CONTEXTE  GENERAL  DE  L'EUOLUTION 

Souvenez- vous  des  annbes  60.  A  cette  bpoque  le  systbme  d ' information  btait 
prdsentd  comme  un  tout  parfaitement  autonome  (Planche  1)  :  une  population  d 'utilisateurs 
et  un  corpus  censd  couvrir  les  besoins  de  cette  population.  L'accent  se  portait  sur  les 
notions  de  rappel  et  de  precision,  avec  les  tests  de  Cranfield,  chers  b  Cleverdon,  abon- 
damment  discutds  en  1965  au  "Nato  Advanced  Study  Group"  de  la  Haye  sur  l'dvalu ation  des 
systfemes  d  '  information  . 

Les  systbmes  d ' in f orma t ion  dea  anndes  BO  seront  ceux  qu'annonce  le  rapport 
Nora-Minc  (1)  dont  je  ne  citerai  que  de  courts  extraits  s  "1  * informatique  de  masse 
irriguera  demain  la  socibtd  comme  le  fait  1 'dlectricitd  (...)  A  la  difference  de  l'dlec- 
tricitd,  la  tdlbmatique  ne  vdhiculera  pas  un  courant  inerte  mais  de  1 ' information , 
e'est-b-dire  du  pouvoir,  Elle  modifiers  le  comportement  des  organisations  et  de  la 
socidtd  toute  entibre 

On  pourrait  aussi  rappeler  la  thfese  de  Porat  (2)  ou  bien  les  etudes 
et  de  l'OCDE  qui  montrent  que  le  taux  de  croissance  du  secteur  informationnel 
restera  beaucoup  plus  61ev6  que  celui  des  autres  secteurs,  qu'on  le  mesure  en 
d'emploi  ou  de  valeur  ajoutee  (Planche  2). 

Les  systbmes  des  anndes  90,  auxquels  nous  aliens  nous  preparer  pendant  la 
decennie  80,  seront  probablement  ceux  qu'annonce  Lancaster  (3)  (Planche  3)  lequel, 
s'appuyant  sur  une  solide  enqu6te  de  Delphes,  nous  predit  que  le  Scientifique  aura  un 
terminal  b  son  domicile,  un  terminal  sur  son  lieu  de  travail...  et  un  terminal  qu'il 
transporters  dans  3es  deplacements,  pour  ne  pas  se  couper  de  1 ' information  puisqu'i 1 
aura  b  prendre  des  notes,  b  bchanger  de  la  correspondence,  b  publier,  b  consulter  des 
banques  de  donnees  primaires  et  secondaires,  le  tout  par  voie  electronique . 

En  attendant,  les  systfemes  des  anndes  80  vont  assurer  la  transition  et  seront 
du  type  dberit  par  la  Planche  A,  e'est-b-dire  que  des  courtiers  ou  des  cellules  de  profes 
sionnel8  de  1 ' information  vont  s'interposer  entre  1 ' ut i 1 isateur  et  les  sources  d'infor- 
mation,  tout  le  temps  qu'il  faudra  a  cet  utilisateur  pour  apprendre  b  purser  lui-mfime 
dans  les  ressources  qui  lui  sont  offertes  b  profusion  mais  de  fagon  plus  ou  moins  adroite 
Nous  allons  avoir  b  agir  sur  les  facteurs  d ' organisation ,  politiques,  dconomiqu',s  et  juri 
diques..  1 'ensemble  des  facteurs  humains  qui  inf luenceront  cette  Evolution  :  vaste 
programme  qui  nous  attend,  et  dont  nous  allons  dvoquer  les  diffdrentes  facettes. 


1.  FACTEURS  D’ORGANISATION 


de  Machlup 
est  et 
termes 


UNISIST 

II  s'agit  tout  d'abord  de  ne  pas  perdre  de  vue  le  programme  UNISIST  (4).  Le 
systbme  mondial  d' information  scientifique  dtudid  et  publid  par  l'UNESCO  en  1971  dnumbre 
des  objectifs.  Les  principes  et  recommendations  qu'on  y  trouve  n'ont  pas  vieilli  dix  ans 
plus  tard,  en  ddpit  de  Involution  technologique ,  ce  qui  est  plutdt  raasurant.  Le  concept 
NATIS*  permet  de  trouver  dans  chaque  nation  1 '  inter locuteur  capable  d'agir  sur  les  fac¬ 
teurs  d 'organisation,  dans  un  cadre  coordonnd,  tout  en  tenant  compte  dea  prioritds  apdei- 
f iquea . 


NATIS  :  National  Information  System 
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5ENSIBXL ISAT I DN  POUR  UNE  RESTRUCTURATION 

Mais  l'organe  national  de  coordination  (par  exemple  la  MIDIST*  en  France) 
s'dpuisera  vite  si  son  action  ne  suscite  que  1 ' indi f f drence  et  ne  petit  pdndtrer  la 
couche  des  cadres  supdrieurs.  Auerbach  et  Slamecka  (5)  soulignent  qu'aux  Etats-Unis, 
il  faut  provoquer  au  plus  haut  niveau  des  Entreprisea  et  de  1 'Administration,  une 
prise  de  conscience  de  la  signification  et  de  la  valeur  de  1 ' information  et  du  potentiel 
offert  par  les  systAmes  d ' in  format  ion  pour  la  stratdgie  d'entreprise  et  la  conduite 
des  dchanges  interna tionaux .  II  faut  done  alerter  les  dirigeants  et  mettre  en  place  un 
programme  et  des  moyens  de  sensibilisation  :  organiser  pour  les  cadres  supdrieurs  de 
l'ordre  administratif  et  technique  des  sdminaires,  visites,  exposds,  conferences,  ddmons- 
trations...  et  une  formation  montrant  la  valeur  de  1 1  information,  son  organisation  et 
les  moyens  d'y  acedder.  II  faut  arriver  &  une  restructuration  des  entreprises  qui  prenne 
en  compte  1 ' in  format  ion  au  niveau  voulu  et  order  le  rdseau  hidrarchisd  d ' information 
dans  1 ' entreprise . 


SVSTEME 


II  faut  aussi  rappeler  A  ces  cadres  que  diriger,  piloter  ou  gouverner,  e'est 
faire  de  la  cyberndtique  et  n'oublions  pas  que  la  cyberndtique  a  dtd  ddfinie  tantflt  comme 
la  science  de  1 '  in  forma tion ,  tantflt  comme  l'art  de  rendre  l'action  efficace,  et  que  tout 
syst&me  n'dvolue  bien  que  si  l'on  sait  canaliser  la  rdtroaction.  Ceci  est  important 
parce  que  la  cyberndtique  continuers  A 
empidter  sur  l'humain,  et  il  n'y  a  pas  lieu 
de  s'en  effrayer.  En  effet,  A  partir  de  la 
reprdsentation  que  propose  Aurel  David  (6) 
on  peut  imaginer  M  (les  facteurs  non  techniques, 
ou  l'outil,  ou  la  machine)  progreasant  vers  A 
(Animus  :  l'esprit...  les  facteurs  non  techniques) 
pour  prendre  en  charge  progressivement  des  fonctions 
nouvelles  de  pilotage,  sans  danger  pour  A  considdrd 
comme  le  point  de  rencontre  de  2  droites  se  coupant 
A  l'infini.  A  gardant  ainsi  toujours  sa  signification 
transcendantale  par  rapport  A  l'outil, 

Il  faut  done  savoir  analyser  le  systfeme,  en  le  ddcoupant  en  autant  de  parcelles 
qu'il  est  possible,  comme  le  veut  Descartes,  mais  sans  perdre  de  vue  que  le  tout  est  supd- 
rieur  A  la  somme  des  parties,  comme  le  rappellent  Aristote  ou  Platon.  Je  proposerais  que 
soit  gravde  partout  la  definition  du  systkme  don t  les  implications  sont  fondamentales 
pour  une  saine  dvolution  s  "ensemble  de  conception  homogfene  auquel  est  attribude  une 
mission  prdcise  dont  1 ' ut i 1 i sa teur  doit  parfsitement  connaltre  les  possibilitds  et  1 ' em- 
ploi  optimal,  congu  comme  un  modAle  perfectible,  avec  recherche,  A  chaque  instant,  du 
meilleur  rdsultat,  par  une  succession  de  choix  et  de  compromis  dietds  par  des  rapports 
de  coOt/ef f icacitd  en  tenant  compte  des  contraintes  de  tous  ordres", 


DECENTRALISATION 


La  ndcessitd  d'une  vision  globale  n'implique  pas  la  centralisation,  Il  ne 
faut  pas  que  des  organisations  se  scldrosent  et  se  fossilisent  dans  leur  "Qudte  Mythique 
du  Grand  SystAme  de  Gestion  Intdgrde",  sans  voir  que  les  logiciels  complexes  et  rigides 
ne  peuvent  dtre  que  morts-nds,  face  au  reddploiement  constant  des  organisations  humaines 
en  environnement  changeant,  Alain  Madec  (7)  cite  cet  ingdnieur  attardd  qui,  ddsireux  de 
contrarier  l'dvolution  informatique  vers  la  ddcentral isation  des  fonction3  disait  : 

"Si  Dieu  avait  voulu  que  1 ' intelligence  soit  distribude,  il  nous  aurait  mis  des  cerveaux 
aux  poignets .  .  . " .  Dans  les  anndes  80  se  poursuivra  cette  ddcentralisation  informatique 
faisant  appel  aux  terminaux  intel 1 igents ,  A  la  mini-informatique,  et  A  1 ’ informatique 
rdpartie . 


MIDIST  :  Mission  Interministdrielle  de  1 ' Information  Scientifique  et  Technique 
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SEPARATION  DES  FONCTIONS 


Par  opposition  au  systAme  des  annAes  60,  on  observe  dAjA  en  61  une  nAcessaire 
dif fArenciation  des  fonctions.  comme  ce  fut  le  cas  dans  l'Avolution  du  rAgne  animal. 


-  Fonction 

-  Fonction 

-  Fonction 

-  Fonction 

-  Fonction 


Etudes  (Besoins,  MarchA,  Promotion,  Prospective). 

Producteur  de  documents  primaires. 

Producteur  de  banques  de  donnAes  textuelles  ou  factuelles. 

Serveur  ( regroupement  et  mise  A  disposition  des  banques  de  donnAes) 

Courtier  (recensement  des  sources  -  accAs  A  ces  sources  -  formation 
des  ut il isa teurs  )  . 


-  Fonction  Formation. 

-  Fonction  Pilotage  (utilisation  de  la  retroaction  pour  agir  but  les  objectifs). 


Fonction  Etudes 

II  est  fondamental  de  se  situer  vis-A-vis  des  utilisateurs  potentials, d ' avoir 
une  politique  coromerciale  en  fonction  de  1 'etude  du  marche,  et  de  conquerir  cette 
clientele  tant  par  I'efficacite  (dAlais-compAtence )  que  par  une  avance  sur  les  concur¬ 
rents  (promotion-prospective). 


Fonction  Producteur  de  documents  primaires 

Auteurs  et  editeurs  doivent  6tre  rAmunArAa  et  fortement  encourages.  II  ne  faut 
pas  order  de  structure  particuliAre  pour  produire  1 ' information  nouvelle,  mais  au  con- 
traire  utiliser  et  encourager  les  structures  existante3  :  Centres  Techniques,  Associa¬ 
tions  ou  Federations  d 'Associations  d ' Ingenieurs ,  Laboratoires  universitaires. . .  Ce 
sont  les  milieux  qui  secrAtent  normalement  1 ' informs t ion ,  s’ils  sont  aides,  encourages, 
orientes  et  leur  production  canalisAe.  La  production  de  documents  primaires  est  le  point 
sur  lequel  doit  porter  l'effort  de  chaque  gouvernement  puisqu'il  s'agit  du  contenu  des 
systAmes  A  construire,  qui  sera  bon  ou  mediocre  selon  que  le  scientifique  sera  sollicite, 
encourage,  aide,  dans  le  cadre  approprie,  A  produire  thAses,  rapports,  articles,  livres, 
brevets,  mAmoires  de  presentation  dans  les  CongrAs...  ou  qu’il  sera  incite  A  se  teire 
et  A  rentrer  dans  sa  coquille.  L'adhAsion  et  la  participation  active  A  une  Association 
au  moins  devraient  Atre  considArAes  comme  une  nAcessitA  pour  chaque  chercheur  ou  ingAnieur. 
C'est  le  cas  des  Etats-Unis  qui  peuvent  nous  montrer  des  exemples  A  mediter  :  American 
Chemical  Society  -  American  Society  for  Metals,  Institute  of  Electrical  &  Electronic 
Engineers,  Engineers  Joint  Council...  qui  ont  organise  leur  systAme  de  publications  pri¬ 
maires  et  secondaires,  et  le  revendent  dans  le  monde. 


Fonctions  Producteur  de  banques  de  donnAes  textuelles  ou  factuelles 

Si  l'on  decide  d'entrer  sur  le  marchA  de  1 ' information  secondaire,  on  doit 
dAfinir  le  secteur,  rechercher  des  appuis  (coopArants)  pour  couvrir  ce  secteur,  norma- 
liser  le  format  et  1 ’ information,  imaginer  des  sous  produits  (magnAtiques  ou  papier) 
et  dAfinir  des  objectifs  de  clientele  (serveurs  pour  les  produits  magnAtiques,  Aditeurs 
pour  les  produits  papier),  LA  encore,  les  Centres  Techniques  et  Associations  d'IngAnieurs 
sont  probablement  les  points  d'ancrage  d’une  action  efficace  et  durable. 

L'idAal  est  que  le  fichier  ou  la  banque  de  donnAes  soit  produit  par  ceux  qui 
en  seront  les  principsux  utilisateurs  et  qui  sauront  le  mieux  orienter  sor.  ontenu  et 
assurer  la  formation  des  autres  utilisateurs. 

Fonction  Serveur 

II  importe  que  les  banques  de  donnAes  textuelles  ainsi  produites  soient 
placAes  et  regroupAes  le  mieux  possible  sur  des  serveurs  peu  nombreux.  L 'utilisateur 
sera  ravi  de  pouvoir  trouver  son  information  sur  un  seul  site  au  lieu  d'avoir  A  faire 
le  tour  de  tous  les  fichiers  de  tous  les  serveurs.  L'effet  de  synergie  est  fondamental. 

Pour  les  banques  de  donnAes  factuelles  au  contraire,  on  peut  admettre  qu'elles 
restent  sur  le  site  de  leur  creation  et  de  leur  mise  A  jour  puisque,  dans  la  plupart 
de3  cas,  le3  problAmes  posAs  devront  etre  AtudiAs  par  les  spAcialistes  charges  de  l'en- 
tretien  de  la  banque,  pour  conduire  A  des  strategies  de  recherche  et  A  des  rAponses 
analysAes  et  interprAtAes  de  fagon  satisfaisante. 
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Fonction  Courtier 

Ce  n'est  pas  encore  dans  les  anndes  80  que  les  ut  i.1  isateurs  traiteront 
eux-mduies  tous  leurs  probldmes  d '  information.  Pendant  cette  pdriode  de  transition, 
les  courtiers  vont  mdme  se  multiplier.  Entendons  par  Id  les  organismes  qui  assurent 
l'interface  entre  1 1 ut i 1 isateur  final  et  les  sources  d ' inf ormation .  Ce  peut  etre  le 
service  d 1  information  interne  &  1 '  entreprise ,  qui  s'est  connects  aux  sources  accessibles 
dont  il  connalt  les  fichiers,  les  procedures  et  les  difficultds  d'accds...  ou  un  Organisms 
extdrieur  d  l'entreprise  qui  se  charge  d'apporter  la  rdponse  si  on  lui  explique  le  pro- 
bldme  (exemple  NERAC*).  En  attendant  que  "1  'utilisateur  final"  soit  tout  &  fait  adulte 
et  competent,  il  y  a  encore  de  beaux  jours  pour  les  courtiers. 

Fonction  Formation  . 

Cette  fonction  ne  peut  pas  continuer  A  etre  confide  seulement  aux  fournisseurs 
de  banques  de  donndes  au  aux  courtiers.  Elle  doit  etre  prise  en  main  par  les  Gouvernements 
suivant  un  programme  A  etablir. 


Fonction  Pilotage 

C'est  5  partir  des  reactions  des  usagers,  encore  une  fois  que  seront  prises 
les  decisions  d'acheter  telle  ou  telle  banque  de  donnees,  ou  seulement  d'y  accdder, 
directement  ou  par  1 ' intermediaire  de  tel  ou  tel  courtier.  L’ennui  si  1 1  on  ddldgue  la 
fonction  recherche  d 1  information  A  un  courtier  c'est  qu’on  court  le  risque  de  ne  jamais 
impliquer  1 ' ut i lisa teur  final  et  canaliser  sa  reaction,  alors  que  si  cette  fonction  est 
interne  A  l'entreprise  on  aura  de  bons  rdsultats  grace  au  couple  utilisateur  final  et 
documentaliste . . .  et  peu  A  peu  si  des  programmes  de  formation  sont  mis  en  place,  si  les 
logiciels  permettent  des  seeds  plus  faciles,  et  si  une  nouvelle  generation  d '  ingdnieurs , 
ddjd  sensibilisde  en  dcole  et  par  l'emploi  des  systdmes  d ' information  grand  public  emerge, 
il  n'est  pas  impassible  que  les  courtiers  disparaissent  et  que  l'on  arrive  progressi vement 
au  systdme  envisage  par  Lancaster...  Pour  assurer  la  fonction  Pilotage,  des  regroupements 
seront  encourages  :  Industrie  de  1  '  in  format  ion ,  Groupe  de  producteurs  de  banques  de  donnees, 
Association  d '  utilisateurs  dont  les  diverses  pressions  retabliront  certains  equilibres. 

Ceci  est  fondamental,  puisque  encore  une  fois,  piloter  c'est  utiliser  la  retroaction. 


2.  FACTEURS  POLITIQUES 

Une  bonne  part  de  1 'action  politique  est  ddjd  dietde  par  les  recommendations 
UNISIST,  et  une  politique  s'elabore  au  niveau  europeen,  ce  qui  permet  d'eviter  en  Europe 
les  initiatives  isoldes  et  les  cloisannements  inutiles.  11  appartient  bien  entendu  A 
chaque  Gouvernement  d'apprdcier  ses  prioritds,  mais  en  se  plagant  dans  le  cadre  trace. 

GOUVERNEMENT  ET  SECTEUR  PRIVE 

A  chacun  son  rflle  :  le  Gouvernement  gouverne,  e'est-d-dire  fixe  les  objectifs 
en  fonction  des  moyens,  favorise  ou  subventionne  les  actions  prioritaires,  ddfinit  une 
politique  de  1 ' information  et  agit  d  ce  titre  sur  les  structures  et  les  organisations 
pour  faciliter  1 'execution  de  cette  politique,  ddveloppe  la  recherche  en  science  de 
1 ' information  pour  en  tirer  de  nouveaux  objectifs  pour  I'avenir. 

Le  secteur  privd  quant  &  lui,  se  voit  confier  la  realisation,  l'execution,  la 
mise  en  place  de  systdmes  viables,  efficaces,  gdndrateurs  de  biens  et  notamment  de  profit. 
Le  3ecteur  privd  est  mieux  placd  pour  assurer  la  gestion,  sauf  si  un  impdratif  s'y  oppose 
(exemple  du  Secret  de  Defense).  Aux  Etats-Unis,  ce  principe  est  clairement  exprime  par 
la  directive  A76  selon  laquelle  le  Gouvernement  federal  s'appuiera  sur  le  secteur  privd 
pour  publier  et  dissdminer  1 ' information  scientifique  et  technique,  sauf  s'il  est  dans 
l'intdrdt  national  d'agir  autrement  (8). 

Le  secteur  privd  tend  d  considdrer  1  '  information  comme  un  bien  dconomique, 
qui  peut  dtre  distribud  suivant  les  mdcanismes  du  marchd,  comme  tous  les  autres  biens 
et  services. 

Les  Gouvernements  considdrent  qu'il  faut  preserver,  inddpendamment  des  lois  du 
marchd,  un  accds  equitable  d  1 ' informs tion.  Ceci  n'est  pas  seulement  un  droit  mais  c’est 
ndces3aire  d  la  preservation  d'une  socidtd  libre. 

On  retombe  ici  sur  le  Freedom  of  Information  Act  qui  a  permis  aux  Etats-Unis 
de  ddplacer  la  frontidre  commerciale  du  marchd  et  Is  frontidre  juridique  du  secret  (17). 
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On  considAre  gAnAralement  que  les  instances  internationales  et  les  gouwernements 
ne  se  dessaisissent  pas  su f f isamment  d'activitAs  d 1  in  format  ion .  Aitchison  (9)  remarque 
qu'aux  Etats-Unis  l'industrie  de  1 ' information  est  plus  forte  et  plus  diversifiAe  qu'en 
Europe.  On  pourrait  en  dAbattre,  en  notant  toutefois  que  l'Amergence  de  l'industrie  de 
1 'information  a  AtA  un  phAnomAne  pAriphArique  et  une  retombAe  de  la  solide  organisation 
et  prise  en  main  de  1 ' information  aux  Etats-Unis  par  le  gouvernement  (NASA  -  NSf  -  NTIS  - 
etc...).  L'industrie  est  en  quelque  sorte  le  reuendeur,  vers  un  marchA  plus  large, 
national  ou  international,  d 'une  information  collectAe  A  une  autre  fin.  Par  ailleurs, 

1 ' organisation  de  1 ' information  est  calquAe  sur  1 ' organisat ion  de  la  recherche  et  des 
differences  notables  existent  sur  ce  plan  entre  Europe  et  Etats-Unis.  II  est  done  possible 
que  des  reorganisations  du  secteur  recherche  entrainent  ipso  facto  une  redistribution 
des  activitAs  d ' in  formation  entre  gouvernement  et  secteur  privA. 

COMPETITION 

II  me  semble  done  qu'on  ne  peut  pas  parler  de  competition  entre  gouvernement 
et  3ecteur  prive.  On  doit  au  contraire  encourager  la  competition  et  I 'emulation  entre 
organisations  du  secteur  prive  -  ne  pas  favoriser  les  situations  de  monopole,  ou  de 
partage  A  1'amiable  du  marche,  situations  qui  conduisent  A  la  stagnation  et  A  l'autosatis- 
faction.  C'est  probablement  la  raison  pour  laquelle  les  systAmes  d ' information  emt  si 
peu  evolue  au  cours  des  derniferes  annAes  s  un  produit  relativement  mediocre  se  vend. 
Pourquoi  prendre  des  risques  en  le  perfectionnant .  C’est  ce  que  j'ai  cru  comprendre  du 
discours  de  clOture  de  Salton  au  dernier  congrAs  de  Cranfield,  attitude  un  peu  plus 
nuancAe  lorsqu'il  dit  "super f iciel lement ,  il  peut  sembler  qu'il  ne  s’est  rien  produit 
dans  les  annAes  rAcentes.  AprAs  tout  le  fichier  inverse  et  la  strategic  boolAenne  remon¬ 
tent  A  l'Apoque  de  la  carte  perforAe  et  existent  toujours  dans  la  plupart  des  systAmes... 
mais  une  observation  plus  precise  montre  que  les  choses  ont  tout  de  mAme  change"... 
et  il  rappelle  que  de  nouveaux  algorithmes  permettent  d'aller  au-delA  (10)  ce  qui  est 
vrai  pour  SMART,  SPIRIT  et  d'autres  logiciels  permettant  de  balayer  un  texte  avec  clas- 
sement  de  pertinence  A  la  sortie  et  dans  des  conditions  plus  proches  du  langage  naturel 
de  celui  qui  interroge.  En  d'autres  termes  la  competition  sur  le  terrain  du  logiciel 
doit  se  dAvelopper.  Il  faut  done  probablement  se  rAjouir  de  voir  par  exemple  que  la 
Commission  FAdArale  des  Telecommunications  (FCC)  a  dAclarA  que  la  concurrence  Atait  dAsor- 
mais  libre  sur  le  terrain  de  la  tAlAmatique  entre  ATT  (ou  le  Bell  System)  et  IBM  ;  ATT 
avait  un  quasi  monopole  tAlAphonique  mais  ne  pouvait  pAnAtrer  jusqu'ici  sur  un  territoire 
qu'IBM  considArait  comme  le  sien,  et  ceci  n'est  qu'un  exemple  d'un  vaste  mouvement  poli¬ 
tique  et  de  reorganisation  qui  tend  A  une  redistribution  des  forces  vi-A-vis  du  secteur 
de  1 ' information  et  de  ses  supports,  aux  Etats-Unis  comme  en  Europe. 

NORMALISATION 

Un  facteur  politique  intervient  nAcessai rement  vis-A-vis  de  la  normalisation. 

Je  partagerais  volontiers  les  rAgles  d'or  AnoncAes  par  Anderla  (11)  s  la  normalisation 
ne  doit  pas  freiner  la  crAativitA,  elle  doit  s’en  tenir  A  l'essentiel  et  assurer  ou  per- 
mettre  une  certaine  stabilitA,  tenir  compte  de  toutes  les  parties  intAressAes  et  de  leur 
avis,  et  permettre  une  evolution  sans  hiatus. 


Nous  savons  que  1 ' information  est  une  denrAe  trAs  spAciale,  imma t Ariel le ,  qui 
se  valorise  en  fonction  de  son  utilisation,  au  contraire  de3  autres  biens,  et  qu’on  ne 
sait  pas  Avaluer,  de  sorte  qu'on  Avalue  souvent  son  support.  Le  seul  critAre  est  de  la 
rattacher  et  de  la  comparer  au  secteur  auquel  elle  s'applique  et  de  se  demander  :  que 
peut-on  consacrer  A  1 ' information  scientifique  vis-A-vis  de  l'enveloppe  recherche,  A 
1 ’ information  Aconomique  vis-A-vis  de  la  compAtition  internationale,  A  1 ' information 
sur  1 'Anergie  vis-A-vis  des  problAmes  d'Anergie,  etc...  plutOt  que  de  ramener  la  valeur 
de  1 ' information  au  coQt  du  traitement  ou  du  support  en  oubliant  que  1 ' information  est 
aujourd'hui  le  premie"  enjeu  dans  la  compAtition  internationale. 

Si  l'on  trouve  que  1 ' information  coQte  cher,  on  peut  dAgager  des  principes 
d'action  qui  en  diminueront  le  coOt  ou  la  valoriseront  davantage  : 

1.  Rendre  quasiment  impossible  la  recherche  conduite  sans  examen  prAalable  de 
1 ' information  dAjA  disponible  sur  le  mfime  sujet  (par  exemple  pression  NSF*  pour  pousser 
A  1 'utilisation  de  la  banque  SSIE  en  liant  le  financement  de  la  recherche  A  une  telle 
participation)  et  introduire  dans  les  contrats  de  recherche  une  clause  obligatoire  de 
recherche  bibl iographique . 

2.  CrAer  en  Europe  des  centres  de  redistribution  de  1 ' inf orma t ion  du  type  NTIS. 
Le  systAme  SIGLE**  pour  la  littArature  grise  est  un  exemple  A  suivre... 


National  Science  Foundation. 

*  • 

System  for  Information  on  Grey  Literature  in  Europe 


3.  Eviter  le  contresena  qui  conduirait  a  tout  sacrifier  b  1 '  informatique  en 
ndgligeant  la  finalitd  qu'est  le  flux  de  donndes.  La  technique  n’est  que  le  contenant. 
Nous  aliens  vers  la  construction  d'un  nouvel  dquilibre  mondial  oij  lea  dchanges  d'infor- 
mation  vont  jouer  un  rOle  central  dans  le  monde  o£i  1  1  interdependence  des  Etats  ne  cesse 
de  croltre.  Les  facteurs  en  presence  sont  d’une  part  la  libre  circulation  de  1' informa¬ 
tion  (au  nom  de  la  liberty  d'expression  et  du  libre  dchange)  et  d'autre  part  la  responsa- 
bilitd  des  dtats  souverains  recherchant  un  dquilibre  resultant  d'dchanges  mutuellement 
profitables . 


MARCHE 

Appleyard  rappelle  qu'en  trouvait  aux  Etats-Unis  14  000  terminaux  en  1969  : 
on  en  trouve  2  millions  aujourd'hui  et  on  en  prdvoit  10  millions  en  1989,  mais  il  y  a 
ddjb  aux  Etats-Unis  130  millions  de  postes  de  tdldvision  qui  feront  partie  du  rdseau 
intdractif  de  demain.  C'est  done  un  changement  d'dchelle  considerable  pour  l'industrie 
et  pour  l'emploi. 

Un  autre  facteur  d'expansion  est  le  fait  que  la  production  documentaire 
augmente,  dans  les  domaines  non  techniques  ou  scient i f iques  et  notamment  l'dconomie, 
ainsi  que  les  besoins  d 1  in  formation ,  et  couvre  rapidement  tous  les  domaines  de  l'activitd 
humaine . 

L'avenir  offre  done  des  opportunity  considdrables  tant  pour  les  banques  de 
donndes  que  pour  les  logiciels  et  les  matdriels  puisque  la  notion  d '  architecture  de 
rdseau  tend  a  obliger  b  considdrer  cos  aspects  comme  un  tout.  Sur  l'ensembie  de  ces 
points,  il  importe  d'dviter  une  domination  d'une  entitd  politique  par  une  autre,  ce 
qui  serait  dangereux*. 

POLITIQUE  DE  PRIX 

La  politique  de  prix  est  lide  b  la  politique  de  produits.  Il  faut  cesser  de 
subventionner  les  produits  dont  le  coOt  est  de  plus  en  plus  dlevd  et  notamment  les 
publications  sur  papier**. 

Il  faut  aussi  faire  perdre  l'idde  que  1  1  information  peut  6tre  gratuite,  et 
cesser  d'offrir  des  services  gratuits,  en  faussant  ainsi  tous  les  exercices  de  comptabi- 
litd . 

Il  faut  tirer  parti  des  nouvelles  technologies  pour  abaisser  les  coOts.  Il 
faut  aussi  developper  une  politique  active  de  sous-produits .  L'effort  se  porte  actuelle- 
ment  sur  la  recherche  bibliographique  en  ligne  mais  il  exists  bien  d'autres  applications 
qui  peuvent  tirer  parti  du  fonds  constitud  :  recherches  bibliomdtriques ,  veille  scienti- 
fique  et  observations  statistiques  ou  analyses  de  toutes  sortes,  facilitds  par  la  mini- 
informbtique . 

Ce  qui  importe  dans  cette  politique  de  prix,  c’est  que  soient  toujours  incluses 
dans  le  prix  les  redevances  b  verser  aux  auteurs  et  dditeurs,  et  que  la  marge  de  profit 
reste  suffisante  pour  dviter  le  recours  b  la  subvention.  Ceci  est  possible  compte  tenu 
de  1  '  d largissement  prdvisible  du  marchd,  des  regroupements  qui  vont  s'effectuer,  de 
1  '  abaissement  des  prix  des  dquipements  et  des  mdmoires,  de  1 ' accdldration  des  ddbits  des 
rdseaux.  Dans  la  compdtition,  les  meilleurs  doivent  survivre,  et  il  n'est  pas  ndeessaire 
qu'ils  soient  trbs  nombreux.  Les  serveurs  ue  sont  d'aill^urs  pas  inquiets  lorsqu'ils 
analysent  la  situation  (12). 

Enfin  il  faut  que  les  gouveriements  ne  viennent  pas  faire  une  concurrence  au 
secteur  privd  de  fagon  abusive,  e'est-b-dire  hors  des  cas  oil  il  intervient  pour  des 
raisons  d'dquitd  pour  rdtablir  un  dquilibre. 

Le  sujet  est  vaste,  et  mdriterait  bien  d'autres  ddveloppements ,  puisque  tout 
montre  qu'on  dvoluera  vers  une  dconomie  in  forma tionnel le ,  suivant  l'analyse  proposde 
par  ESKL ,  LION  et  P0G0REL  (13)  dont  je  cite  un  court  extrait  "Les  grandes  manoeuvres... 
autour  des  nouveaux  medias  mettent  en  Evidence  la  recherche  de  nouveaux  produits  et  de 
nouveaux  moddles  de  consummation  susceptibles  de  prendre  le  relais  du  modble  logement 
-  dlectro-mdnager  -  automobile,  qui  avait  dtd  le  moteur  de  croissance  des  anndes 
IP- 15-1973.  L ' information  apparalt  ainsi  comme  le  champ  nouveau  de  la  croissance  au 
moment  oCj  les  socidtds  industriel  les  se  heurtent  aux  limites  posdes  par  la  matdrialitd 
des  produits". 


L'Europe  face  aux  technologies  de  1 ' information.  Le  Dossier  de  l’Europe.  E  1980  - 
3  mars  1980.  Commission  des  Communautds  Europdennes. 

Lea  principaux  dditeurs  se  sont  d'ailleurs  ddjb  rapprochds  du  secteur  informatique, 
ou  bureautique  ou  tdldmatique,  qui  reprdsente  leur  seui  avenir,  si  l'on  en  croit 
Lancaster  (3). 
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4.  FACTEURS  JURIDIQUES 

Le  problAme  majeur  reate  celui  du  copyright,  maia  d'autrea  problAmes  se  pose- 
run  t,  qui  influenceront  lea  ayatfemes  d 1  information  :  flux  tranaf ronti&res ,  responsabi 1 i tA 
du  fournirseur  de  donnAes. 


COPYRIGHT 

Si  1 ' in  formation  eat  encore  trop  rarement  offerte  aur  le  marchA  c'eat  parce 
que,  comme  le  aouligne  MADEC  (14),  faute  d'une  protection  auffiaante  de  la  propriAtA 
inte I lectuelle ,  la  retention  eat  considArAe  comme  la  meilleure  garantie  contre  le  "pira- 
tage" . 


Contre  la  retention,  il  y  a  le  "Freedom  of  Information  Act",  maia  un  veritable 
droit  de  1 ' information  eat  A  mettre  en  place  pour  couvrir  toua  lea  aapects  juridiquea 
posAs  notamment  par  l'Avolution  de  la  technologie  qui  permet  toua  les  abua,  toutea  lea 
manipulations  et  rend  difficile  le  controls  des  flux  de  donnAes.  II  eat  possible  qu'en 
cette  matiAre,  l'ordre  international  prAcfede  l'ordre  interne  et  lui  offre  un  canevaa 
uni  forme . . . 


En  l'abaence  de  cette  legislation  internationale ,  il  convient  d'adopter  une 
attitude  prudente  et,  en  tout  Atat  de  cause,  d'appliquer  la  legislation  aur  la  propriete 
intel lectuelle .  Cette  recommandation  eat  importante  si  l'on  veut  qu'un  certain  laxiame 
qui  s'instaure  dAjb  ne  conduise  pas  A  des  conflits.  Par  example  Holmes  (15)  rapporte 
les  propos  de  Martin  HOWE,  A  l'Institute  of  Information  Scientists  le  29  septembre  1980, 
faiaant  valoir  que  "le  copyright  se  trouve  acquis  A  l'usager  (vested  with  the  user)  qui 
a  pris  possession  des  donnAes  en  vertu  de  la  formulation  complexe  qu'il  a  eu  A  utiiiser 
pour  lea  trouver"...  ce  qui  est  fascinant  selon  l'expreasion  de  Holmes.  11  aerait  urgent 
de  trouver  dea  solutions  de  compromis,  des  codes  de  dAontologie  qui  pourraient  Atre 
introduits  dans  les  contrats  ou  protocoles  entre  usagers  et  serveurs,  serveurs  et  produc- 
teurs  de  banques  de  donnAes,  producteura  de  banques  de  donnAes  et  Aditeura  ou  auteurs, 
a'il  s'agit  de  personnes  morales  diffArentea.  Toute  une  rAglementation  se  dAveloppera 
au  niveau  des  protocoles  d'accAa. 

FLUX  TRANSFRONT IERES 

La  confArence  de  Juin  80  tenue  A  Rome  aur  ce  aujet  faisait  suite  A  la  coherence 
SPIN  qui  recommandait  l'examen  t 

-  dea  initiatives  nationales, 

-  des  programmes  internationaux  de  recherche  su:  ’  ■.»  flux  <*  ^onnAea  et 
1  '..armoniaation  des  r  Aglemen  tat  ions  national'.*  '  1). 

Les  systAmes  juridiquea  sont  fondAs  sur  la  notion  de  territorialitA.  Les  pays 
exergant  lour  souverainetA  en  rAglementant  A  1'intArieur  de  leurs  frontiAres  les  activitAs 
tangibles  ou  visibles,  mais  1 ' information,  transformAe  en  unitAs  binaires  invisibles 
transmiaea  sur  de  grandes  distances  va  obliger  A  modifier  les  systAmes  juridiques  natio- 
naux.  Certains  Atats  risquent  de  dresser  des  "barriJres  invisibles  aux  Achanges  invisi¬ 
bles"  parce  que  la  libre  circulation  des  donnAes  dApasse  de  trAs  loin  la  notion  de  libre 
Achange.  Elle  ouvre  la  voie,  sans  posaibilitA  de  contrOle,  A  la  fraude,  A  1 ' espionnage, 
au  dumping  ou  A  l'Avaaion  de  bAnAfices,  peut  entrainer  des  dAsAquilibrea,  une  vulnArabi- 
litA,  une  altAration  de  1'identitA  culturelle.  Toutea  ces  idAes  sont  dAveloppAes  par 
Madec  (14).  Ce  contrOle  ou  cette  circulation  devrait  s'exercer  aans  s'opposer  aux  cinq 
principes  fondamentaux  dont  s'inapirent  les  Etats-Unis  :  1)  Libre  circulation  de  l'infor- 
mation.  2)  Protection  de  la  vie  privAe.  3)  Libre  jeu  des  mAcanismes  du  marchA.  4)  Libre 
Achange.  5)  DiaponibilitA  d ' installations  et  de  services  de  tAlAcommunicationa  appro- 
priAs  (17). 

RESPONSABILITES  EN  MATIERE  DE  FOURNITURE  DE  PONNEES 

La  question  de  aavoir  dans  quelle  mesure  le  producteur  de  la  banque  de  donnAes 
est  reaponsable  de  la  qualitA  et  de  la  fiabilitA  des  donnAes  fournies  s'est  dAJA  posAe 
et  se  posera  A  nouveau.  Le  aerveur  n'eat  certainement  pas  en  cause.  Les  limites  de  res- 
ponsabilitA  du  producteur  devront  Atre  dAfinies  dans  le  cadre  d'une  recherche  qui  abou- 
tira  a  une  codification  et  A  un  droit  de  1 ’ information  apportsnt  aur  ce  point  auasi  la 
clartA  souhaitable. 


5.  FACTEURS  HUMAINS 

En  France  on  parle  de  la  loi  du  moindre  effort,  et  non  pas  de  la  loi  de  M00ERS 
pour  expliquer  qu'un  ayatAme  d'information  tendra  A  ne  pas  Atre  utilijA  lor. qu'il  sera 
plus  pAnible  de  l'utilieer  que  de  a'en  passer.  Ouoiqu'il  en  soit  cette  vArite  est  easen- 
tielle.  Il  faut  done  en  tenir  compte  si  l'on  veut  que  le  marchA  de  1 ' information  se 
dAveloppe  et  attire  1  'utiliasteur  final. 


On  a  dej4  bien  etudie  les  systfemes  interactifs  et  le  comportement  de  l'utili- 
sateur  d'un  terminal.  II  est  mSme  regrettable  que  cet  acquis  n’ait  pas  ete  utilise  pour 
les  systfemes  VIDEOTEX  qui  ont  refait  leur  apprentissage  des  mfimes  erreurs. 

L'ergonomie  du  systfeme  est  essentielle.  L'analyse  conduit  6  isoler  des  dfefauts 
de  taille,  qui  souvent  peuvent  Stre  facilement  corrigds  :  effet  de  pression,  effet  de 
vitrine,  difficultfes  diverses  de  lecture  de  l'fecran,  d '  ut i 1 isa t ion  du  clavier,  phobies 
ou  allergies  possibles... 

L'ingfenieur  est  aujourd'hui  gdndralement  angoissfe  &  l'idfee  d'utiliser  un  cla¬ 
vier  et  de  converser  avec  un  ordinateur.  Mais  il  faut  bien  dire  que  rien  n'est  fait  pour 
le  mettre  fe  l'aise  (messages  abrupts  du  genre  "invalid  statement",  "erreur  de  syntaxe". 

L ' utilisateur  est  finalement  absorbd  par  la  "mdcanique  de  1  ’  interrogation"  au  point  qu'il 
en  oublie  son  sujet.  Les  ddlais  entre  les  messages,  les  repetitions  de  procedures... 
tout  cela  peut  entrainer  la  fatigue,  1 'ennui  et  la  frustration  definitive.  Les  anodes  BO 
devraient  permettre  d'arriver  4  une  amelioration  sensible  de  la  jonglerie  demandde  a 
1 'utilisateur  qui  devrait  pouvoir  explorer  un  fonds  documentaire  et  etre  satisfait  de 
1 'ef f icaci td ,  au  regard  de  l'effort  accompli.  L'accfea  au  document  primaire  restera  le 
point  noir  tant  que  chaque  serveur  n'aura  pas  introduit  la  commande  au  terminal,  et  que 
chaque  pays  n'aura  pas  organise  l'accfes  au  document  primaire. 

L ' ut i 1 isa teur  voudra  aller  plus  loin  et  constituer  lui-meme  sa  documentation 
en  "siphonant"  1 ' information  issue  de  diverses  sources.  Des  experiences  en  cours  telles 
que  SHOEBOX  et  AUTONOTE  (18)  permettent  ddjd  d'dtudier  les  comportement3  et  d'en  ddduire 
des  solutions  approprides  offertes  pour  amdliorer  la  communication  entre  l'homme  et  la 
source  d ’ in  forma tion . 

L'objectif  n'est  en  effet  pas  de  plier  1 '  utilisateur  au  systfeme  construit  sans 
lui,  ni  de  le  contraindre  &  utiliser  un  dquipement  impose  qui  perturbe  son  environnement . 
L'objectif  est  un  systdme  construit  avec  1 'uti lisa teur ,  et  dvoluant  avec  lui  et  pour 
lui... 

BARR1ERES  LINGUIST IQUES 

II  faudrait  r'evenir  4  l'UNISIST  qui  souhaitait,  dans  la  recommandation  4  du 
Groupe  I  (Outils  de  1 ' intercommunication  entre  systfemes)  que  soient  dfeveloppfes  des  outils 
qui  servent  au  contrdle  et  4  la  conversion  des  langue3  naturelles  dans  la  science  et 
la  technologie  :  normaliser  les  nomenclatures  scientif iques  sur  une  base  internationale , 
mettre  au  point  des  vocabulaires  canoniques  ou  thesaurus  destines  aux  systfemes  d'infor- 
mation  et  faire  progresser  l'indexation  automatique  des  documents. 

Au  lieu  de  s'en  tenir  4  ces  objectifs  prudents  permettant  le  balayage  de  la 
littdrature  secondaire  et  1 ' ident i f icat ion  de  son  contenu,  on  a  prdfdrd  poursuivre  la 
chimfere  de  la  traduction  automatique  des  documents  primaires,  avec  des  rfesultats 
variables,  bons  ou  mauvais  selon  qu'on  y  croit  ou  qu'on  n'y  croit  pas  (il  y  a  un  lien 
entre  la  traduction  automatique  et  la  mfedecine  non  orthodoxe). 

L'objectif  limite  que  proposait  l'UNISIST  avait  l'avantage  de  pouvoir  "accro- 
cher"  chaque  langue  nouvelle  4  un  systfeme  relativement  simple  de  normalisation  des 
concepts  sans  entrer  dans  l'analyse  de  chaque  langue  (TITUS  est  un  excellent  exemple 
de  cette  approche  prudente). 

Je  continue  de  penser  que,  malgrfe  les  progrfes  indfeniables  realises  dans  la 
traduction  automatique,  l'objectif  limite  de  l'UNISIST  permettrait  ddjfe  une  certaine 
intercommunication  de3  systfemes,  suffisante  pour  se  tenir  au  courant  de  la  production 
de  chaque  groupe  linguistique ,  et  decider  alors  de  traduire  dventuel lement  tel  ou  tel 
document  primaire...  En  matifere  de  traduction  automatique,  le  premier  objectif  devrait 
etre  de  balayer  la  littdrature  secondaire,  en  rAglant  par  14  aussi  le  problfeme  des 
coflts  dlevd3  de  saisie  puisque  1 ' inf ormation  dans  ce  cas  existe  dfejfe  sur  support 
lisible  par  calculateur. 


CONCLUSION 


J'ai  essayfe  d'fevoquer  les  facteurs  qui  influenceront  les  systfemes  d '  information 
de  ia  dfecennie  dans  laquelle  nous  venons  d'entrer. 

L'enjeu  est  considerable  et  le  marche  est  en  pleine  expansion. 

Il  nous  appartient  d'agir  des  4  present  pour  que  ces  facteurs  aient  des 
influences  favorables. 

Trop  d'aspects  ont  ete  fevoqufes  ici  en  trop  peu  de  temps.  Oes  analyses  beaucoup 
plus  approfondies  ont  dfejfe  ete  faites,  notamment  par  les  auteurs  que  j'ai  cites. 
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Si,  aprfes  cet  expose,  on  me  demandait  de  rdsumer  en  quelques  mots  les  points 
sur  lesquels  l'effort  principal  doit  porter,  je  repondrais  sans  hesitation  s 

1)  d'une  part  faciliter  l'accbs  aux  banques  de  donndes  en  simplifiant  les 
procedures,  en  introduisant  le  lanqage  naturel  avec  des  logiciels  beaucoup  plus  perfor- 
mants  que  ceux  qui  sont  rdpandus  aujourd'hui  et  en  a 1 interessant  &  d'autres  families 
linguistiques . 

2)  D'autre  part,  montrer  aux  ingenieurs  et  techniciens  que  1 ' inf ormation  leur 
est  indispensable  et  que  l'accfts  S  1 ' information  est  facile,  et  entreprendre  une  vaste 
campagne  de  sensibilisation  et  de  formation,  notamment  au  niveau  des  Ecoles  et  de  l'Uni- 
versite,  pour  que,  dans  la  competition  internationale  ddje  amorcde,  les  richeases  aujour¬ 
d'hui  accessibles  ne  restent  pas  ignordes  ou  sous-u t i 1 isdes ,  avec  les  perils  rue  cela 
implique.  II  ne  s'agit  pas  seulement  de  formation  mais  d'une  education  devant  conduire 

b  une  rdelle  prise  de  conscience  de  l'enjeu  international,  et  conduisant  6  transformer 
de  fagon  radicale  le  comportement  des  scientif iques  des  annees  80  via-i-vis  de  1'infor- 
ma t ion . 

Si  ces  objectifs  sont  poursuivis  et  atteints,  le  marche  prendra  soin  de 
lui-mSme  et  la  promotion  des  systfemes  ne  sera  plus  qu'une  affaire  d'emulation  entre 
concurrents  visant  la  meilleure  satisfaction  des  besoins. 
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(This  English-text  version  of  the  Paper  presented  by  M.  Yanez  has  been  produoed 
in  response  to  requests  made  by  a  number  of  persons  who  attended  the  Munich 
Symposium .  It  is  regretted  that  time  did  not  allow  Englieh-text  versions  of 
the  illustrations  to  be  produced.  Accordingly ,  readers  are  referred  to  the  French- 
text  figures  which  appear  on  the  immediately  preceding  pages.) 


SUMMARY 


A  look  backwards  to  the  60' s  and  a  reminder  of  the  evolution  which  led  to  the  so-called 
information  era  will  help  in  setting  the  general  framework.  The  Information  market  of 
the  80' s  will  obviously  be  buoyant  if  we  are  able  to  go  on  building  good  information 
systems  and  if  we  act  accordingly  on  the  organizational,  political,  economic,  legal 
and  human  factors  involved.  These  themes  are  not  dealt  with  exhaustively,  but  the 
purpose  is  to  highlight  those  actions  which  are  felt  to  be  essential.  A  number  of 
pitfalls  will  be  avoided  if  we  do  not  put  off  the  sound  preparation  of  the  systems  of 
the  80* s. 


GENERAL  FRAMEWORK  OF  THE  EVOLUTION 

Can  you  remember  the  60' s?  In  those  days,  the  information  system  was  looked  upon  as  a 
self-contained  entity  (Fig.l):  a  "user  population"  and  a  "corpus  of  abstracts"  supposed 
to  cover  the  needs  of  the  population.  The  accent  was  put  on  recall  and  precision,  with 
the  Cranfield  Tests,  endeared  to  Cleverdon,  so  thoroughly  discussed  at  the  1965  NATO 
Advanced  Study  Course  In  The  Hague,  The  Netherlands  -  a  meeting  dealing  with  the  evaluation 
of  information  systems. 

The  information  systems  of  the  80' s  are  going  to  be  those  heralded  by  the  Nora-Minc 
report  (1)  from  which  this  short  extract  is  taken:  "mass  informatics  will  permeate 
tomorrow's  society  just  like  electricity  ....  Unlike  electricity,  telematics  will  not 
carry  an  inert  current  but  information,  i.e.,  power.  It  will  transform  the  behaviour  of 
organizations  and  of  society  as  a  whole  ..." 

We  could  just  as  well  quote  Porat's  thesis  (2)  or  Machlup  and  OECD  reports  showing  that 
the  growth  rate  of  the  information  sector  is  and  will  remain  higher  than  that  of  other 
sectors,  whether  we  look  at  it  in  terms  of  employment  or  of  added  value  (Fig. 2) . 

The  systems  of  the  90' s,  which  we  are  going  to  prepare  in  the  80's,  will  presumably  be 
those  envisioned  by  Lancaster  (3)  (Fig. 3)  who  uses  a  sou, id  Delphi  Survey  to  predict  that 
the  scientist  will  have  a  terminal  at  home,  a  terminal  in  nis  office  or  lab  ...  and  a 
terminal  he  will  carry  out  on  his  journeys,  in  order  to  keep  in  touch  with  his  notebooks, 
his  mailbox,  publishers,  and  data  banks  through  an  electronic  link. 

In  the  meantime,  the  systems  of  the  80's  will  be  transitional,  and  of  the  type  illustrated 
in  Fig. 4,  which  shows  that  brokers  or  other  information  professionals  are  going  to 
interface  between  the  user  and  information  sources,  until  the  time  comes  when  the  end- 
user  has  learned  how  to  go  it  alone  and  to  use  the  resources  which  are  so  profusely 
(and  yet,  up  to  now,  awkwardly)  offered  to  him.  That's  why  we  should  be  prepared  to  act 
on  the  various  factors  which  are  going  to  Influence  this  evolution:  it  is  a  wide  ranging 
programme,  and  we'll  tackle  some  of  its  most  obvious  facets. 


1.  ORGANIZATIONAL  FACTORS 


UNISIST 

First  and  foremost,  we  should  not  lose  sight  of  the  UNISIST  programme  (4) .  The  world 
system  of  scientific  information,  as  designed  and  published  by  UNESCO  in  1971,  lists 
a  number  of  objectives.  The  principles  and  guidelines  put  forward  in  '71  are  in  no  way 
obsolete  10  years  later,  in  spite  of  technological  changes,  which  is  rather  reassuring. 
The  NATIS*  concept  makes  it  possible  to  find  in  every  nation  the  counterpart  formally 
entitled  to  influencing  organizational  factors,  in  a  coordinated  framework,  and  in  the 
light  of  specific  requirements. 


*  National  Information  System 
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MANAGEMENT  AWARENESS  IN  VIEW  OF  THE  NEEDED  REORGANIZATION 

The  national  focal  point,  such  as  MIDIST*  in  France ,  will  soon  be  neutralized  if  its 
efforts  give  rise  to  nothing  but  indifference,  or  cannot  permeate  the  management 
stratum.  Auerbach  and  Slamecka  (5)  stress  that  in  the  United  States  lack  of  management 
awareness  remains  a  problem,  so  that  many  firms  and  public  services  are  unable  to  assess 
the  full  meaning  and  value  of  information  and  the  potential  of  information  systems  for 
the  firm's  strategy  and  the  monitoring  of  international  trade.  Thus,  we  should  alert 
managers  through  a  programme  aimed  at  them:  it  could  be  in  the  form  of  seminars,  visits, 
lectures,  exhibitions  and  demonstrations,  and  some  sort  of  training  on  the  subject  of 
information,  its  organization,  and  the  proper  way  to  use  it.  This  will  lead  to  a 
reorganization  of  the  firms,  which  will  take  care  of  information  at  every  level  and  will 
tend  to  build  a  hierarchical  network  of  information  tailored  to  the  organization. 


SYSTEM 

We  should  also  remind  management  of  the  fact  that  to  direct,  steer,  or  govern  is  to 
step  into  cybernetics,  which  has  been  defined  as  the  science  of  information,  or  the 
art  of  making  action  efficient.  The  system  is  healthy  if  the  proper  feedback  is  obtained. 
This  is  more  and  more  important,  as  cybernetics  will  go  on  moving  towards  what  had 
previously  been  human  territory,  and  this  should  not  frighten  us.  We  can  obviously 
use  Aurel  David's  (6)  representation  M  of  the  technical  factors,  or  the  tool,  or  the 


machine  moving  towards  A  (animus  or  spirit  or  non-technical  factors)  to  take  care  of 
new  steering  functions  ...  without  any  danger  for  A  which  is  the  point  where  two  parallels 
intersect.  A  will  no  doubt  maintain  its  transcendant  value  as  opposed  to  the  tool. 

In  the  '80's,  we  must  be  able  to  analyze  the  system,  and  to  cut  it  into  as  many  parts  as 
possible,  as  suggested  by  Descartes,  but  we  must  not  lose  sight  of  the  fact  that  the 
whole  is  superior  to  the  sum  of  the  parts,  as  emphasized  by  Aristotle  or  Plato.  I 
would  suggest  that  a  definition  of  the  system  and  its  fundamental  implications  be 
engraved  wherever  possible:  "a  homogeneously  designed  ensemble  to  which  a  specific 
mission  is  given,  the  possibilities  of  which  are  fully  known  by  the  user,  who  is  well 
aware  of  its  optimal  use,  from  which  the  best  result  is  looked  for,  at  any  point  in  time, 
through  a  succession  of  choices  and  trade-offs  inspired  by  cost-efficiency  relationships, 
with  due  consideration  of  all  existing  constraints". 


DECENTRALIZATION 

The  requirement  of  a  global  vision  does  not  imply  centralization.  Organizations  should 
get  rid  of  their  sclerosis  and  the  fossilized  condition  which  derives  from  their  "Mythical 
Quest  for  the  Grand  System  of  Integrated  Management" ,  without  seeing  that  complex  and 
rigid  softwares  can  only  be  still-born,  in  view  of  the  unending  redeployment  of  human 
organizations  in  their  moving  environment.  Alain  Madec  (7)  makes  references  to  this 
backward- looking  engineer  who  was  eager  to  thwart  the  normal  trend  of  data-processing 
towards  a  decentralization  of  functions,  and  was  heard  saying  "If  God  had  wished 
intelligence  to  be  distributed,  he  would  have  put  brains  in  our  fists  ..."  However, 
in  the  80' 8,  the  decentralization  and  use  of  intelligent  terminals,  minicomputers  and 
distributed  data  processing  will  have  to  be  enforced. 


SEPARATION  OF  FUNCTIONS 

As  opposed  to  the  systems  of  the  60's,  we  are  already  experiencing  a  move  towards  the 
differentiation  of  functions,  just  as  occurred  in  the  animal  kingdom: 

-  A  design  function  (including  users'  needs,  marketing,  long-term  studies) 

-  A  primary  document  producer  function 

-  A  data  base  (textual  or  factual)  producer  function 

-  A  host  computer  (or  vendor)  function 

-  A  broker  function  (inventory  of  sources  -  best  access  to  the  appropriate  sources  - 

training  of  users) 

-  An  education  function 

-  A  steering  function  (management  based  on  systems  analysis  and  use  of  feedback) . 


Design  Function 

It  will  remain  fundamental  to  find  one's  proper  place  in  relation  to  potential  users, 
to  have  a  marketing  policy  dictated  by  the  market  study,  and  to  retain  customers  through 
efficiency  as  well  as  by  maintaining  an  edge  over  competitors. 
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Primary  Document  Producer  Function 

Authors  and  publishers  must  be  rewarded  and  strong  Incentives  must  be  found  to  enable 
this  to  be  done.  There  is  no  need  to  create  a  new  structure  for  this  function.  On 
the  contrary,  there  is  a  strong  case  for  using  and  reinforcing  the  existing  structures: 
technical  centres.  Engineering  and  Scientific  Societies  and  their  federations,  University 
laboratories  and  their  scientists,  these  are  the  bodies  which  normally  produce  information, 
if  they  are  aided,  encouraged,  oriented  and  if  their  material  is  collected.  The 
production  of  primary  documents  is  the  item  which  should  rank  first  in  every  government's 
efforts  since  obviously  the  contents  should  be  there  if  a  container  is  to  be  provided. 

The  documents  will  be  of  high  or  mediocre  quality  according  to  whether  the  scientist 
is  invited,  encouraged  and  helped,  in  the  appropriate  framework,  to  produce  theses, 
reports,  articles,  books,  patents,  papers  to  be  discussed  in  Symposia  ...  or  he  is  just 
ignored  and  ends  up  withdrawn  into  his  shell. 

An  active  participation  in  and  membership  of  one  scientific  society  at  least  should  be 
a  must  for  any  engineer  or  scientist.  Such  is  the  case  in  the  United  States  and  a  lot 
of  pondering  is  needed  in  Europe  over  the  achievements  of  such  Societies  as  the  Institute 
of  Electrical  and  Electronics  Engineers,  the  American  Chemical  Society,  the  American 
Society  for  Metals  ...  the  Engineers  Joint  Council,  whose  primary  and  secondary 
publications  are  organized  and  redistributed  or  sold  throughout  the  world. 


The  Database  Producer  Function 

When  you  decide  to  enter  the  market  of  secondary  information,  you  will  have  to  define 
your  sector,  look  for  associates  in  a  cooperative  venture,  provide  full  coverage  of  the 
sector,  standardize  your  formats,  look  for  by-products.  There  again.  Technical  Centres 
and  Engineering  Societies  will  probably  be  sound  anchor-points  for  efficient  and  durable 
action.  The  aim  is  to  have  files  or  data  banks  produced  by  representatives  of  the 
potential  users,  i.e.,  people  who  are  most  likely  to  be  knowledgeable  as  to  the 
orientation  of  the  contents  and  the  training  of  users. 


Vendor  or  Host  Computer  Function 

Textual  data  banks  thus  produced  should  be  placed  on  a  restricted  number  of  hosts.  The 
user  will  be  delighted  to  find  information  in  one  place  instead  of  having  to  shop  around 
the  various  files  of  a  number  of  host  computers.  Synergy  is  fundamental. 

Conversely,  factual  data  banks  are  best  placed  at  the  location  where  they  were  set  up 
and  are  updated  since,  in  most  cases,  problems  raised  by  the  users  will  require  analysis 
by  specialists  attached  to  the  data  bank,  and  it  is  they  who  are  best  able  to  choose  the 
right  search  strategies  and  comment  upon  the  results  in  a  satisfactory  way. 


The  Broker  Function 

It  is  unlikely  that  in  the  80 's  users  will  be  able  to  solve  by  themselves  all  their 
information  problems.  In  this  transitional  period,  new  brokers  are  even  going  to  emerge 
and  grow.  By  brokers  we  mean  anybody  who  interacts  between  the  end-user  and  the 
information  sources.  It  may  be  a  service  offered  inside  the  organization,  by  someone 
who  provides  a  link  with  the  available  sources  and  is  fully  knowledgeable  on  the  command 
languages ,. the  contents  of  files,  protocols  and  specific  difficulties  of  access.  Or  it 
may  be  some  agency  outside  the  organization  that  is  supposed  to  produce  the  answer  once 
the  problem  has  been  thoroughly  discussed  with  its  staff  of  analysts  (such  as  NERAC*) . 

Until  the  end-user  is  fully  grown  up  and  educated,  there  will  be  thriving  days  for 
brokers . 


The  Education  Function 

This  function  is  too  important  to  be  left  only  to  the  care  of  data-base  producers,  hosts 
and  brokers.  It  will  have  to  be  grasped  by  governments  and  introduced  into  the  syllabus 
of  schools  and  universities. 


The  Steering  Function 

Again,  it  is  the  users'  feedback  which  will  lead  to  the  decision  either  to  buy  this  or 
that  database,  or  simply  to  access  a  host,  directly  or  through  a  broker.  The  trouble  is 
that,  if  the  search  function  is  handed  over  to  a  broker,  the  end-user  runs  the  risk  of 
being  forever  kept  out  of  the  game  and  his  feedback  will  not  be  available  as  it  should 
be,  whereas  if  the  search  function  belongs  to  the  organization,  the  coupling  of  the  end- 
user  with  an  analyst  will  stimulate  and  train  the  user. 

How,  if  and  when  education  programmes  are  provided,  if  command  languages  and  software 
are  improved  and  easier  access  to  the  terminal  is  provided,  and  if  a  new  generation  of 
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engineers  has  been  educated  at  school  and  at  home  through  the  use  of  videotex  sytems  and 
home  computer*,  brokers  will  become  extinct  and  Lancaster's  vision  of  the  future  will 
prevail. 

In  the  framework  of  this  steering  function,  mergings  must  occur,  of  the  sort  we  are 
already  witnessing:  Information  Industry  association,  Data-base  producers  associations, 
and  users'  associations,  and  their  pressure  will  insure  the  proper  balance  between  the 
acting  forces.  Their  feedback  action  will  be  fundamental  for  speeding  up  improvements. 


2 .  POLITICAL  FACTORS 


A  significant  part  of  the  political  action  is  already  suggested  in  the  UNISIST  guidelines 
and  a  policy  is  being  built  at  a  European  level,  which  should  prevent  moves  being  taken 
in  isolation,  and  unnecessary  partitioning.  It  is  the  task  of  every  government  to  assess 
specific  priorities  and  to  act  in  the  direction  of  the  guidelines. 


PUBLIC  VERSUS  PRIVATE  SECTOR 

The  roles  should  be  clear-cut:  governments  have  to  define  objectives  in  accordance  with 
the  available  means,  to  promote  and  subsidize  top  priority  moves,  state  what  the  information 
policy  will  be  and  act  accordingly  upon  structures  and  organizations  in  order  to  bring 
about  an  implementation  of  the  policies  and  develop  research  in  information  science  from 
which  new  objectives  will  be  set  for  the  future. 

The  private  sector  will  be  entrusted  in  essence  with  the  implementation  tasks  and  the 
operation  of  facilities  needed  for  viable  and  efficient  systems,  creating  new  goods  and 
profit.  The  private  sector  is  best  equipped  for  management  except  if  an  overriding 
consideration  is  at  stake  (for  example.  Classified  information).  In  the  United  States, 
this  principle  is  clearly  stated  in  the  A76  Act,  which  provides  that  the  Federal  Government 
will  rely  on  the  private  sector  for  the  publication  and  dissemination  of  scientific  and 
technical  information,  unless  it  is  in  the  national  interest  to  act  otherwlse(8) . 

The  private  sector  tends  to  look  upon  information  as  an  economic  good,  which  can  be 
distributed  according  to  the  market  mechanisms,  just  like  any  other  goods  or  services. 

Governments  maintain  that  an  equitable  access  to  Information  must  be  preserved, 
independently  of  market  laws,  not  only  because  freedom  of  access  to  information  is  a 
citizen's  right  but  because  it  is  essential  for  the  preservation  of  a  free  society. 

We  here  are  falling  back  on  the  Freedom  of  Information  Act  which  made  it  possible  in  the 
United  States  to  displace  the  commercial  frontier  of  the  market  and  the  legal  frontier 
of  the  Classified  data  (17). 

It  is  generally  felt  that  international  bodies  and  governments  are  prone  to  keep  their 
information  activities  to  themselves.  Aitchison  (9)  observes  that,  in  the  United  States, 
the  information  Industry  is  stronger  and  more  diversified  than  in  Europe.  We  could  take 
issue  on  this,  and  point  out  that  the  growth  of  an  information  industry  has  been  a 
peripheral  phenomenon  and  a  spin-off  of  a  sound  and  stubborn  government  action  (COSATI, 

NASA,  NSF,  NTIS  ...).  The  information  industry  appears  as  a  middle-man  organization, 
aiming  at  a  wider  national  or  international  market  for  Information  initially  collected 
for  another  purpose.  Furthermore,  the  information  organization  is  a  carbon  copy  of 
the  research  organization,  and  the  conditions  in  Europe  are  substantially  different  from 
those  in  the  U.S. 

It  is  possible  that  reorganizations  in  the  research  activities  will  entail  a  new 
distribution  of  responsibilities  between  government  and  private  sectors. 


COMPETITION 

It  seems  to  me  that  there  cannot  be  any  competition  between  the  government  and  the 
private  sector,  where  monopolistic  situations  should  be  avoided,  as  well  as  any  friendly 
sharing  of  the  market,  or  any  situation  leading  to  stagnation  and  self-satisfaction.  This 
is  probably  the  reason  why  information  systems  have  shown  few  improvements  in  recent  years: 
a  mediocre  product  has  its  customers ,  so  why  should  we  take  any  risks  by  searching  for 
improvements?  This  is  what  I  gathered  also  from  the  after-dinner  lecture  given  by 
Salton  at  the  1979  Cranfield  meeting.  He  is  somewhat  more  subtle  (with  his  tongue  in  his 
cheek)  when  he  writes:  "Superficially,  it  may  seem  that  little  has  occurred  in 
information  retrieval  in  recent  years.  After  all,  inverted  file  organization  and  Boolean 
query  statements  go  back  to  the  punched  card  era  and  still  prevail  in  most  systems. 

A  closer  look,  however,  reveals  that  on-line  systems  have  changed  the  established 
order..."  and  he  reminds  us  that  new  algorithms  would  make  it  possible  to  go  a  step 
further  (10),  which  is  true  with  SMART,  SPIRIT  and  other  softwares  which  would  make  it 
possible  to  scan  the  literature  and  deliver  a  proper  ranking  of  relevant  documents  at 
the  output,  starting  the  query  in  the  plain  language  of  the  user.  In  other  words, 
competition  in  the  software  field  should  develop.  And  perhaps  we  should  rejoice  over 
the  Federal  Communications  Commission's  (FCC)  statement  that  competition  is  now  open 
in  the  field  of  telematics  between  ATT  (or  the  Bell  System)  and  IBM.  ATT  had  a 
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monopolistic  position  with  regard  to  telephones  but  could  not  up  to  now  penetrate  a 
territory  looked  upon  as  IBM's.  This  is  merely  one  example  of  a  vast  political  and 
organizational  drive  tending  to  a  redistribution  of  forces  in  the  information  sector 
which  has  started  in  the  United  States  as  well  as  in  Europe. 


STANDARDIZATION 

Politics  are  necessarily  involved  in  standardization  activities.  I  would  agree  with  the 
golden  rules  stated  by  Anderla  (11) :  standardization  should  not  be  a  hindrance  to 
creativity,  and  should  essentially  ensure  or  permit  a  certain  stability,  take  good  account 
of  the  opinion  of  all  the  parties  concerned,  and  enable  a  steady  evolution. 


3 .  ECONOMIC  FACTORS 

We  all  know  that  information  is  a  very  special,  intangible  commodity.  Its  value  is 
related  to  its  degree  of  use.  Yet  we  often  value  the  recording  medium  instead  of  the 
information  it  contains.  The  only  criterion  for  assessing  its  value  is  to  relate  it  to 
the  sector  which  produces  or  uses  it.  We  may  ask  ourselves  what  amount  of  money  should 
be  devoted  to  scientific  and  technical  information  in  relation  to  the  amount  devoted  to 
research,  or  to  economic  information  in  relation  to  the  requirements  of  international 
economic  competition,  rather  than  any  assessment  related  to  the  cost  of  processing,  thus 
losing  sight  of  the  fact  that  information  is  much  more  than  paper  or  tape.  It  is  the 
prime  commodity  at  stake  in  present  and  future  international  competition. 

If  people  find  that  information  is  costly,  I  suggest  we  apply  revised  standards  of 
judgement  which  will  value  it  more  and  thus  decrease  its  relative  cost: 

1.  Let  us  make  it  Impossible  to  start  any  'new'  piece  of  scientific  research  without 

a  prior  investigation  of  what  has  already  been  done  and  written  on  the  subject  (something 
like  the  National  Science  Foundation  pressure  exerted  on  private  and  government  research 
organizations  to  urge  them  to  use  SSIE  if  they  want  to  obtain  any  research  grants) . 

Let  us  also  enforce  the  introduction  in  research  contracts  of  a  standard  clause  which 
makes  it  mandatory  to  start  research  only  when  a  bibliographic  survey  has  been  conducted. 

2.  Let  us  create  in  Europe  centres,  such  as  NTIS,  for  the  redistribution  of  information. 
The  SIGLE*  System  is  a  good  example. 

3.  Let  us  avoid  any  misunderstanding  which  could  lead  to  the  sacrifice  of  information 
on  the  altar  of  informatics,  so  that  we  lose  sight  of  the  prime  objective,  namely,  the 
flow  of  information.  The  technical  aspect,  the  equipment,  is  nothing  but  the  container. 
We  are  heading  towards  a  new  world  balance  in  which  information  exchanges  are  going  to 
play  a  central  part  and  where  the  interdependence  of  states  will  increase.  The  existing 
factors  are  on  the  one  hand  the  free  circulation  of  information  (in  the  name  of  liberty 
and  free  trade)  and  on  the  other  the  responsibility  of  sovereign  states  to  seek  a  balance 
based  on  mutually  profitable  exchanges. 


MARKET 

Appleyard  reminds  us  that  there  were  14,000  terminals  in  the  United  States  in  1969.  Today 
there  are  two  million  and  some  ten  million  are  predicted  for  1989.  But  there  are  already 
in  the  United  States  130  million  TV  sets  which  will  become  part  of  the  Interactive 
network  of  tomorrow.  There  will  thus  be  a  considerable  change  of  scale  for  Industry  and 
employment . 

Another  expansion  factor  is  the  fact  that  document  output  is  increasing,  in  non-sclentlflc 
sectors,  and  in  particular  in  the  economic  sector,  in  tune  with  a  general  reliance  of  the 
individual  upon  information,  which  is  thus  spreading  to  all  sectors  of  human  activity. 

The  future  offers  huge  opportunities,  for  data  banks  as  well  as  for  software  and  hardware, 
which  cannot  be  dissociated  since  the  very  concept  of  network  architecture  requires 
consideration  of  the  overall  system.  On  all  these  points,  no  one  political  entity  should 
be  allowed  to  prevail  over  any  other,  as  this  would  be  dangerous.** 


PRICING  POLICY 

Pricing  policy  is  tied  to  products  policy.  It  will  become  necessary  to  stop  subsidizing 
products  whose  costs  are  constantly  escalating.  In  particular,  this  applies  to  hard-copy 
publications. *** 

The  belief  that  information  should  be  free  will  disappear  and  we  should  from  now  on  stop 
offering  products  or  services  free  of  charge,  as  this  practice  makes  nonsense  of  all 
accountancy  exercises. 


System  for  Information  on  Grey  Literature  in  Europe 

L'Europe  face  aux  technolgies  de  1' information.  Le  Dossier  de  l'Europe.  E  1980 
3  March  1980.  The  European  Common  Market  Commission. 

The  principal  publishers  have,  by  the  way,  already  become  involved  in  the  fields  of 
informatics,  telematics,  etc.,  which,  if  Lancaster  (3)  is  to  be  believed,  represent 
their  only  future. 
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A  decrease  in  prices  will  result  from  a  better  use  of  the  new  technologies,  also  from  the 
search  for  more  )  y-products .  Efforts  are  now  centered  on  the  on-line  retrospective 
search,  but  there  are  hosts  of  uses  which  can  be  made  of  a  data  bank,  such  as 
bibliometric  studies,  alert  systems,  statistical  observations  and  analyses  of  all 
sorts,  which  are  made  easier  by  the  use  of  mini  or  microcomputer  systems. 

What  is  important  in  arriving  at  pricing  policies  is  that  full  consideration  is  always 
given  to  authors  and  publishers,  and  that  the  profit  margin  is  such  that  the  need  for 
subsidies  is  avoided. 

This  is  possible,  in  view  of  the  foreseeable  widening  of  the  market  and  of  mergers  which 
are  bound  to  occur,  on  top  of  a  decrease  in  the  prices  of  equipment  and  storage  units, 
and  the  speeding  up  of  data  flow  rates  on  networks.  In  that  competition,  survival  will 
be  for  the  fittest,  and  there  is  no  need  to  have  great  numbers  of  them.  We  should  note 
that  the  hosts  or  information  vendors  are  not  really  anxious  when  they  analyse  the 
present  situation  (12) . 

Last  but  not  least,  government  service  should  not  come  on  the  stage  as  mere  competitors 
of  the  private  sector  in  an  obtrusive  way,  that  is  in  situations  other  than  those  where 
government  action  is  needed  for  equity  or  balancing  purposes. 

The  subject  is  wide-ranging,  and  deserves  ample  discussion,  since  it  is  obvious  that  we 
are  moving  towards  an  information  economy,  according  to  the  thesis  put  forward  by  Eskl, 
Lion  and  Pogorel  (13) ,  from  which  this  extract  is  taken:  "The  great  manoeuvres  which  are 
developing  around  the  new  media  are  evidence  of  a  search  for  new  products  and  new 
consumer  models  likely  to  constitute  the  next  stage  in  the  relay  race,  taking  over  from 
the  'housing/home  appliances/personal  car/model'  which  had  been  the  growth  force  of  the 
1945-1973  era.  Information  thus  appears  as  the  new  growth  territory  at  a  time  when 
industrial  societies  are  being  confronted  with  the  limits  imposed  by  the  materiality 
of  products." 


4 .  LEGAL  FACTORS 

The  main  problem  remains  that  of  copyright,  but  other  problems  will  emerge  and  will 
influence  information  systems:  transfrontier  information  flow,  for  example,  and  the 
responsibilities  of  the  providers  of  data. 


COPYRIGHT 

If  information  is  still  too  scarce  on  the  market,  it  is  because,  as  stressed  by  Madec  (14) , 
the  protection  of  intellectual  property  is  insufficient  and  retention  is  looked  upon  as 
the  best  guarantee  against  piracy. 

Against  retention,  there  is  the  Freedom  of  Information  Act,  but  an  actual  Information  Law 
remains  to  be  drawn  up  to  cover  all  the  legal  aspects  arising,  in  particular,  from  the 
progress  in  technology  which  makes  it  tempting  to  trespass  the  law  when  handling  data, 
and  which  makes  it  difficult  to  monitor  the  movements  of  data.  It  is  likely  that,  in 
this  matter,  international  law  will  act  as  the  harbinger  of  internal  law  and  will  offer 
the  needed  framework. 

In  the  absence  of  international  legal  guidelines,  our  attitude  should  be  one  of  prudence 
and,  for  all  purposes,  follow  the  strict  enforcement  of  the  present  law  on  intellectual 
property.  This  recommendation  is  important  if  we  want  to  avoid  more  of  the  permissive 
practices  which  are  already  in  evidence  and  which  could  well  turn  into  disputes.  For 
example.  Holmes  (15)  refers  to  a  statement  made  by  Martin  Howe  at  the  institute  of 
Information  Scientists  on  29  September  1980,  to  the  effect  that  "copyright  might  be 
vested  with  the  user  who  has  taken  ownership  of  the  data  by  virture  of  the  complex 
search  formulation  he  has  used  to  retrieve  them"  which  is  fascinating,  in  Holmes'  own 
words.  It  would  be  urgent  to  find  trade-offs  and  codes  of  practice  which  could  be 
inserted  in  contracts  or  protocols  between  users  and  vendors ,  vendors  or  hosts  and 
data-base  producers,  data-base  producers  and  publishers  or  authors,  when  the  latter  are 
not  part  of  the  same  corporate  entity.  A  set  of  rules  will  have  to  be  developed  for 
access  to  information  through  agreed  memoranda  of  understanding. 


TRANSFRONTIER  INFORMATION  FLOW 

The  conference  held  in  Rome  in  June  1980  on  this  topic  was  in  the  wake  of  a  SPIN 
conference  which  recommended  an  examination  of: 

-  national  initiatives 

-  national  research  programme  on  data  flows  and  the  harmonization  of  national 

regulations  (16). 

Legal  systems  are  founded  upon  a  concept  of  territoriality.  Countries  exert  their 
sovereignty  by  enacting  laws  within  their  frontiers  on  the  subject  of  tangible  or 
visible  activities,  but  information,  when  transformed  into  invisible  binary  units 
transmitted  over  great  distances,  is  going  to  bring  about  a  modification  of  the 
national  legal  systems.  Some  states  might  raise  invisible  barriers  to  invisible 
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exchanges  because  the  free  transfer  of  data  is  far  removed  from  the  concept  of  free 
trade.  It  opens  the  way  to  fraud,  without  any  possibility  of  checking  it,  to  espionage, 
to  dumping  practices,  or  to  the  evasion  of  profit,  and  car.  thus  bring  about  conditions 
of  unbalance,  vulnerability,  or  an  alternation  in  cultural  identity.  All  these  points 
are  developed  by  Madec  (14) .  This  control  of  transfer  should  be  exerted  without 
encroaching  upon  the  five  fundamental  principles  which  inspire  the  United  States: 

1)  free  circulation  of  information,  2)  protection  of  privacy,  3)  free  action  of  market 
mechanisms,  4)  free-trade,  5)  availability  of  the  appropriate  telecommunication  utilities 
and  services  (17). 


RESPONSIBILITIES  VESTED  WITH  THE  SUPPLIER  OF  DATA 

The  problem  of  the  extent  of  liability  incurred  by  the  data  bank  supplier  as  regards  the 
quality  and  reliability  of  the  data  he  provides  has  already  been  raised  and  is  going  to 
be  discussed  again.  The  host  is  certainly  not  involved.  The  scope  or  limits  of 
responsibility  of  the  data-base  supplier  should  be  the  subject  of  research  leading  to 
a  code  of  practice  as  part  of  an  information  law,  in  order  to  bring  on  this  last  point 
the  desired  clarification. 


5 ■  HUMAN  FACTORS 


In  France,  we  speak  of  the  law  of  least  effort,  and  not  of  Mooers*  Law,  to  explain  that 
an  information  system  will  tend  not  to  be  used  whenever  it  will  require  more  effort  to 
use  it  than  to  do  without  it.  Be  that  as  it  may,  this  truth  is  essential.  We  have  to 
keep  it  in  mind  if  we  want  the  information  market  to  expand  and  to  attract  more  end-users. 

Interactive  systems  and  the  behavioural  aspects  of  the  user  of  these  systems  have  already 
been  exhaustively  studied.  It  is  a  great  pity  that  these  observations  have  not  been  used 
for  the  Videotex  Systems  currently  under  review,  which  have  gone  through  the  same  teething 
troubles  and  errors. 

The  ergonomic  aspects  of  system  use  are  essential.  Sizeable  defects  have  been  found 
through  observation  and  most  of  them  can  be  corrected:  pressure  effect,  fishbowl  effect, 
peephole  effect,  legibility  and  readability  of  the  information  displayed  on  a  screen  or 
on  a  printer  and  a  host  of  defects  of  this  sort  which  give  rise  to  allergies  or  phobias. 

Today's  engineers  are  most  often  panicked  by  the  very  thought  of  using  a  keyboard  and  of 
conversing  with  a  computer.  But  we  must  recognize  that  little  is  being  done  to  make 
them  relax:  stern  messages  appear  such  as  "invalid  statement"  or  "syntax  error". 

The  user  is  eventually  engrossed  by  the  mechanics  of  the  "dialog"  to  the  extent  that  he 
loses  sight  of  his  initial  problem.  The  time  lag  between  messages,  the  repetition  of  arid 
procedures.  All  this  leads  to  weariness,  dissatisfaction  and  eventual  frustration. 

The  80' s  should  bring  a  significant  improvement  in  the  juggling  capabilities  systems 
expect  from  the  end-user,  who  should  be  able  to  scan  a  file  with  satisfaction  as  to  the 
eff iciency-to-ef fort  ratio.  Access  to  the  primary  document  will  remain  a  matter  for 
concern  as  long  as  the  command  for  ordering  documents  is  not  available  at  the  terminal  - 
that  is,  as  long  as  every  country  has  not  organized  proper  access  to  primary  Information. 

The  end-user  is  going  to  be  asking  for  more,  and  will  wish  to  be  able  to  handle  his  own 
documentation  in  his  own  way  by  "siphoning"  on  his  intelligent  terminal  information  from 
various  souces.  Such  experiments  as  Shoebox  and  Autonote  (18)  are  under  way  and  are 
used  to  assess  the  behaviour  of  users  and  infer  solutions  likely  to  improve  communication 
between  the  scientist  and  his  information  sources. 

The  purpose,  in  effect,  is  not  to  make  the  user  accept  the  shape  of  a  system  built  in 
his  absence,  or  to  try,  under  duress,  to  lure  him  into  using  an  equipment  which  disturbs 
his  environment.  The  purpose  is  to  let  him  suggest  what  the  system  should  be  and  how 
it  should  grow. 


LANGUAGE  BARRIERS 

Here  we  should  come  back  to  UNISIST  and  a  wish  expressed  in  guideline  4  of  Group  I 
recommendation  (Tools  for  intercommunication  between  systems) :  Tools  should  be 
developed  for  the  control  and  conversion  of  natural  languages  in  science  and  technology, 
i.e.,  a  standardization  of  scientific  nomenclatures  on  an  international  basis,  the 
development  of  canonic  vocabularies  or  thesauri  used  in  Information  systems,  and  the 
advancement  of  automatic  indexing. 

Instead  of  working  within  these  limited  objectives  aimed  at  scanning  secondary  literature 
in  order  to  identify  its  contents,  the  chimera  of  automatic  translation  was  preferred, 
with  variable  results,  good  or  bad  according  to  whether  you  are  a  believer  or  not  (there 
is  an  affinity  between  automatic  translation  and  unorthodox  medicine) . 

The  limited  objectives  set  by  UNISIST  represent  common  sense,  in  view  of  the  fact  that 
new  languages  could  gradually  have  been  entered  in  to  this  simple  system  (Titus  is  an 
excellent  example  of  this  prudent  approach) . 
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I  am  still  convinced  that,  despite  the  obvious  advance  noticeable  in  automatic  translation, 
the  limited  objectives  set  by  UNISIST  would  permit  a  better  intercommunication  between 
systems  of  different  languages ,  and  would  be  enough  to  get  an  awareness  of  the  production 
of  each  linguistic  group  ....  with  a  view  to  translating  what  is  most  relevant.  I  suggest 
the  first  step  remains  the  scanning  of  secondary  literature,  which  also  solves  the 
problem  of  high  input  costs,  since  the  information  is  available  in  machine-readable  form. 


CONCLUSIONS 

I  have  been  trying  to  point  out  what  factors  will  be  influential  in  the  decade  we  have 
just  entered. 

The  stake  is  tremendous  and  the  market  will  go  on  expanding. 

It  is  our  duty  to  act  now  in  order  to  make  sure  that  these  factors  will  have  favourable 
influences. 

Too  many  facets  have  been  raised  here  in  a  short  time. 

Further  consideration  of  these  problems  is  needed,  starting  in  particular  from  UNISIST 
and  the  authors  I  mentioned. 

If  you  were  to  ask  me  what  are,  in  a  few  words,  the  facets  which  require  most  attention, 

I  would  unquestionably  answer: 

1.  first,  ease  access  to  data  banks  by  simplifying  procedures  and  by  using  natural 
language  with  more  efficient  softwares  than  today  ...  and  be  more  attentive  to  other 
languages . 

2.  Remind  scientists  and  engineers  of  the  fact  that  information  is  their  normal  food, 
and  undertake  a  campaign  aimed  at  schools  and  universities  in  order  to  make  sure  that, 
in  the  current  international  competition,  no  information  resources  are  ignored  or  under¬ 
used.  This  is  more  than  a  matter  of  training.  It  is  a  real  education  which  should  lead 
to  the  full  awareness  of  the  stake  involved,  and  a  resulting  transformation  of  the 
behaviour  of  the  scientists  of  the  80' s  with  regard  to  information. 

If  these  objectives  are  pursued  and  attained,  the  market  will  no  doubt  take  care  of  itself 
and  the  marketing  of  information  systems  will  simply  be  a  matter  of  emulation  between 
competitors  eager  to  work  for  a  better  satisfaction  of  their  users. 
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In  broad  terms,  automated  indexing  for  document  content  analysis  is  a  method  intended 
to  supersede  or  supplement  the  knowledge  and  experience  of  the  human  analyst.  It 
requires: 

-  the  construction  of  programmable  rules  for  the  analysis  of  texts,  and 

-  the  development  and  application  of  a  relational  dictionary  connecting  the 
words  in  the  texts  to  the  controlled  indexing  vocabularies. 

The  unresolved  difficulties  of  constructing  such  large  dictionaries  are  the  main  reasons 
why  automated  indexing  has  not  as  yet  emerged  from  the  laboratory. 

Free-text  search  has  been  developed  for  application  to  documents  not  intellectually 
analyzed  and  as  a  supplement  to  search  based  upon  intellectually  indexed  documents. 

Both  the  increasing  costs  and  inconsistent  quality  of  intellectual  indexing  and  the 
unsatisfactory  situation  with  regard  to  automated  indexing  has  led  to  the  increasing 
application  of  free-text  search.  However,  the  association  between  an  initial  query 
formulation  and  its  free-text  alternative  terms  is  just  the  opposite  to  that  of  the 
indexing  vocabulary  of  documents.  Both  methods  -  automated  indexing  and  assistance 
to  free-text  search  -  are  involved  with  similar  problems:  to  substitute  or  reduce 
intellectual  work  using  stored  information  in  forms  of  relationships  between  words. 

Both  techniques  can  be  regarded  as  complementary  and  aimed  towards  achieving  Improved 
retrieval. 

Introduction 


About  ten  years  ago,  G  Salton  (II,  along  with  other  experts,  expected  that  automated 
indexing  would  be  routinely  applied  in  the  near  future.  As  is  now  obvious,  this 
expectation  proved  overly  optimistic.  The  questions  whether  computerized  content 
analysis  is  practically  feasible  for  general  applications  and  when  a  proven  system 
for  content  analysis  will  be  achieved  remain  very  much  open. 

Today  I  should  like  to  summarize  what  the  problems  of  intellectual  and  automated  indexing 
are.  In  addition,  I  intend  to  discuss  why  the  expected  development  of  computerized 
content  analysis  was  delayed  and  what  has  to  be  done  to  transform  existing  experimental 
systems  scattered  throughout  the  world  into  standardized  tools  applicable  to  information 
retrieval.  Details  concerning  the  different  approaches  and  problems  in  automated 
indexing  are  omitted;  they  are  discussed  elsewhere  (e.g.  (2),  (3)). 

Any  description  of  the  state  of  automated  indexing  must  include  a  discussion  of  the 
free-text  method  for  Information  retrieval.  I  hope  to  make  clear  that  both  methods  - 
automated  indexing  and  free-text  retrieval  -  are  essentially  related  as  to  their 
problems  and  methods. 

The  Concept  of  Content  Analysis 

The  analysis  of  documents  for  the  purpose  of  information  retrieval  includes  two  main 
steps: 

1.  the  determination  of  what  a  document  is  about; 

2.  the  translation  of  this  conceptual  analysis  into  an  appropriate  documentation 
language,  independently  of  whether  a  classification  system  is  used  or  controlled 
vocabularies  as  usually  found  in  thesauri. 

To  accomplish  the  first  step,  the  knowledge  of  an  expert  in  the  field  covered  by  the 
documents  is  required.  The  second  step  requires  an  expert  in  the  special  field  of 
documentation.  Finally,  both  functions  have  to  be  integrated  by  one  person  -  the 
Indexer . 

Thesauri  indexing  results  in  the  intellectual  assignment  of  a  set  of  descriptors  to  each 
document.  In  most  cases,  no  relationships  are  indicated  between  such  thesaurus 
descriptors.  The  accomplishment  of  Indexing  good  enough  so  that  subsequent  retrieval 
will  achieve  acceptably  relevant  search  results  can  only  be  regarded  as  an  art. 

Over  the  last  decade,  three  factors  in  document  analysis  have  become  increasingly 
important : 
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1.  the  immense  increase  of  the  production  of  scientific  and  technical  literature; 

2.  the  rapidly  growing  costs  per  document  for  indexing  and  classification  work;  and 

3.  the  lack  of  qualified  specialists  capable  of  meeting  the  requirements  stated  above. 

The  existing  bottleneck  in  documentation  work  involving  content  analysis  is  a  direct 
consequence  of  the  above  factors. 

During  this  same  time  period,  printing  technology  was  developed  which  uses  machine- 
readable  data  for  computer  editing  and  subsequent  photo-typesetting.  As  a  side-product, 
machine-readable  databases  on  magnetic  tapes  have  become  available  consisting  of 
abstract  journals,  current  awareness  services,  and  similar  printed  products.  However, 
the  material  within  these  databases  has  either  not  been  subjected  to  content  analysis 
or  has  only  been  analyzed  to  the  limited  extent  required  for  producing  printed  indexes. 
The  intellectual  analysis  of  such  databases  for  information  retrieval  simply  is  not 
economically  or  humanly  feasible. 

The  Problems  of  Automated  Indexing 

During  the  sixties,  the  lack  of  collections  of  machine-readable  documents  large  enough  to 
prevent  misleading  results  was  one  of  the  main  problems  in  conducting  and  evaluating 
experiments  using  automated  indexing  methods.  Today  this  problem  has  been  resolved. 

It  now  has  become  evident  that  quite  a  different  quantitative  problem  with  regard  to 
automated  indexing  perhaps  had  been  somewhat  underestimated.  Analogous  to  the  situation 
with  intellectual  indexing,  automated  indexing  requires  an  extensive  set  of  instructions 
for  accomplishing  the  conceptual  analysis  and  for  translating  the  results  into  the 
documentation  language  used.  These  instructions  consist  of: 

-  a  list  or  "dictionary"  of  the  complete  free-text  vocabulary  of  the  subject  field 
to  which  automated  indexing  will  be  applied;  and 

-  a  list  of  the  semantic  relationships  between  the  free-text  terms  and  those  of 

the  controlled  vocabulary  established  within  a  consistent  and  sufficiently  complete 
computer  programmable  model. 

In  addition,  algorithms  for  the  comparison  between  such  lists  and  the  document  texts  must 
be  available.  Other  analytic  features,  e.g.  for  morphological  and  syntactical  analyses, 
and  procedures  for  the  evaluation  and  assignment  of  indexing  terms  to  documents  must  be 
provided. 

Even  this  extremely  simplified  description  of  the  requirements  for  automated  indexing 
should  indicate  that  the  current  bottleneck  involves  the  strategies  and  methods  for 
obtaining  the  needed  instructions,  and  the  integration  of  these  into  the  dictionaries  of 
the  indexing  system. 

The  Problems  of  Free-text  Retrieval 


By  the  early  seventies  it  became  evident,  due  to  reasons  in  part  just  discussed  above, 
that  automated  indexing  would  not  replace  intellectual  indexing  in  the  short-term  future. 

At  the  same  time,  intellectual  indexing  proved  too  costly  and,  given  the  quantity  of 
machine-readable  documents,  impractical  except  for  certain  special  cases.  The  existing 
free-text  retrieval  systems  became  available  as  "emergency  exits”  from  this  nearly 
hopeless  situation.  However,  one  must  now  emphasixe  that  free-text  retrieval  essentially 
transforms  all  of  the  problems  of  content  analysis  into  problems  associated  with  search 
formulation  and  the  retrieval  process. 

Free-text  retrieval  basically  means  that  most  or  all  parts  of  the  stored  documents,  e.g. 
titles,  abstracts,  are  searchable.  Further,  so-called  context  operators  are  available, 
functioning  as  surrogates  for  syntactical  analysis,  in  addition  to  the  well-known  Boolean 
operators  AND,  OR,  and  NOT.  Note  that  free-text  systems  offer  the  searcher  no  assistance 
in  resolving  semantical  problems.  As  such,  good  retrieval  -  obtaining  useful  and  relevant 
results  -  is  dependent  upon  the  experience,  skill,  intuition,  and,  of  course,  the  good 
will  of  the  searcher  far  more  so  than  with  descriptor  retrieval. 

Comparison  of  Both  Methods 

As  a  framework  for  some  suggestions  concerning  future  developments,  a  simplified  comparison 
between  the  indexing  and  free-text  retrieval  concepts  is  offered. 


INDEXING  FREE-TEXT  RETRIEVAL 


controlled  term  free-text  equivalents  of  the  initial  term 
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Indexing  is  an  a-priori  method.  The  indexer  has  the  opportunity  of  viewing  the  topics 
or  subjects  covered  by  the  document  in  context.  The  full  textual  material  of  the 
documents  is  available  to  aid  in  selecting  appropriate  indexing  terms.  On  the  other 
hand,  at  the  time  of  indexing  the  full  range  of  potential  aspects  with  which  documents 
may  later  be  retrieved  from  the  perspectives  of  the  variety  of  potential  requestors  of 
information  typically  is  unknown  or  poorly  stipulated. 

In  contrast,  free-text  searching  is  an  in-situ  method.  For  retrieval,  the  aspects  for 
which  the  search  is  being  conducted  are  known  as  formulated  by  the  requestor  of  the 
information.  As  such,  retrieval  is  independent  of  the  subjective  contributions  of 
indexers  assigning  descriptors  to  documents  which  can  at  best  only  partially  describe 
the  full  informational  contents.  With  free-text  search,  it  is  the  searcher  left 
essentially  on  his  own  in  selecting,  evaluating,  and  determining  the  usage  of  terms 
with  which  a  concept,  topic  or  subject  may  be  expressed  or  denoted  in  the  textual  material. 
In  additon,  the  searcher  has  to  deal  with  problems  stemming  from  the  varieties  of 
syntactical  structures  found  in  textual  material  with  only  the  primitive  tools  of 
context  operators . 

Suggestions  Concerning  Further  Development  of  Automated  Indexing 

Earlier,  it  was  observed  that  the  fundamental  problem  for  automated  indexing  today  Involves 
the  development  of  methods  for  constructing  dictionaries  of  words  with  their  relationships 
established  to  the  vocabulary  of  a  given  subject  field.  The  task  is  to  accomplish  the 
above  in  a  reasonable  time  with  the  assurance  the  the  result  is  sufficiently  complete, 
consistent,  and  computer  programmable.  As  a  first  step,  perfectionism  should  be  avoided 
as  well  as  overreliance  upon  a  subjectively  favoured  technique.  For  example,  both 
linguistic  and  statistical  methods  should  be  considered.  The  results  Of  automated 
indexing  experiments  should  no  longer  be  evaluated  by  a  simple  comparison  with  those  of 
intellectual  indexing.  It  seems  neither  helpful  nor  realistic  to  restrict  automated 
indexing  experimental  work  to  a  very  small  scope  or  subject  field.  Such  efforts  to 
achieve  minimal  improvements  cannot  be  extrapolated  nor  seem  worth  the  candle. 


Independently  of  the  way  in  which  indexing  terms  are  assigned  to  documents,  descriptor 
retrieval  should  be  regarded  as  one  approach  attempting  to  retrieve  documents  as  well  and 
completely  as  possible.  Free-text  retrieval  should  be  regarded  as  yet  another  approach; 
a  complementary  rather  than  a  competitive  one. 


Suggestions  Concerning  Further  Development  of  Free-text  Retrieval 


Modern  free-text  retrieval  systems  sometimes  produce  more  problems  than  they  offer 
assistance  in  satisfying  informational  needs.  Under  these  circumstances,  the  major 
requirement  would  be  to  reduce  the  demand  for  searchers  to  associate  terms  for  query 
formulation  on  his  own  by  procedures  offering  terms  for  selection  either  intellectually 
or  automatically  by  means  of  proposals  made  by  the  computer.  Different  methods  are 
described  in  the  literature  for  the  production  of  such  search  aids  using  mainly  statistical 
means  (e.g.  (4)).  Perfectionism  should  not  be  required  in  producing  such  aids.  It  is  a 
simple  task  for  the  searcher  to  select  appropriate  terms  and  to  ignore  inappropriate 
ones.  Retrieval  systems  should  be  adapted  to  integrate  such  search  aids  and  to  make 
their  application  as  simple  as  possible. 


Extenslonal  table  for  the  term  MASKING 


Statistics:  13  docs,  containing  MASKING 
428  correlated  terms 


Probability  of  "co-occurence": 


61.5% 

PERCEPTION 

61.5% 

VISUAL 

46.2% 

BACKWARD 

46.2% 

MASK 

38.5% 

CONDITIONS 

38.5% 

SUGGEST 

38.5% 

TARGET 

38.5% 

TARGETS 

38.5% 

THRESHOLDS 

30.8% 

OLDS 

30.8% 

PROCESSING 

23.1% 

FREQUENCY 

30.8% 

RECOGNITION 

23.1% 

HEARING 

30.8% 

STIMULUS 

23.1% 

PITCH 

23.1% 

AGE 

23.1% 

SCHIZOPHRENIA 

23.1% 

AUDITORY 

23.1% 

COMPARED 

23.1% 

DISCRIMINATION 

EXAMPLE  TAKES  FROM  (4)  , 

23.1% 

DURATION 

PSYCHOLOGICAL  ABSTRACTS 

23.1% 

23.1% 

EMPLOYED 

FRENCH 

Proposal  for  a  Synthesis  of  Both  Methods 


Automated  indexing  may  be  described  as  a  method  of  stepwise  transformation  from  a  single 
free-text  word  or  set  of  them  to  a  term  of  the  documentation  language.  In  contrast, 
free-text  retrieval  may  be  interpreted  as  a  stepwise  broadening  of  a  basic  retrieval 
query  by  the  addition  of  adequate  terms  using  the  logical  operators  in  order  to  match 
the  individual  formulations  of  a  concept  in  the  documents. 


From  these  two  statements,  a  simple  conclusion  may  be  drawn:  it  should  b  possible  to 
apply  the  same  dictionary  of  terms  and  their  relationships,  the  prerequisite  for  automated 
indexing,  towards  assisting  the  searcher  in  free-text  retrieval.  At  the  same  time,  lists 
of  terms  and  their  relationships  developed  for  automated  indexing  could  be  updated  and 
improved  by  subsequent  evaluations  of  intellectually  produced  retrieval  queries. 
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The  above  and  often-mentioned  dictionary  considered  as  a  resource  for  both  methods  seems 
to  me  to  be  an  overriding  argument  that  automated  indexing  and  free-text  retrieval  should 
be  regarded  as  complementary  approaches  to  solve  the  central  problems  in  information 
retrieval.  Further  developments  in  both  areas  would  benefit  by  emphasizing  their 
complementary  rather  than  competitive  aspects. 
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PACT  RETRIEVAL  IN  THE  1980s 


Viktor  E.  Hampel 
Technology  Information  System 
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Lawrence  Livermore  National  Laboratory 
P.O.  Box  808,  Livermore,  California  94SS0,  USA 


SUMMARY 

This  report  reviews  prevailing  methodologies  of  fact  retrieval  in  science  and  technology  and  makes  surprise-free 
projections  for  the  decade  to  come:  Numeric  databases  are  shown  to  overtake  in  size  and  number  the  large 
bibliographic  collections.  This  is  expected' to  lead  toward  more  sophisticated,  interactive  data  analysis  techniques 
with  graphical  display  options.  The  availability  of  low-cost  intelligent  computer  terminals,  micro-  and  minicomputers, 
is  shown  to  make  aggregation  and  post-processing  of  retrieved  information  from  different  sources  readily  possible. 
This  capability  may  come  into  conflict  with  legal  constraints  and  is  bound  to  affect  the  traditional  marketing  of 
information.  It  will  lead  to  the  extraction  of  higher  forms  of  intelligence  from  text  and  data.  The  user  community  is 
seen  to  shift  from  expert  information  specialists,  who  act  now  as  middlemen,  to  the  end-users  of  information.  This 
less  experienced  user  community  will  challenge  the  ingenuity  of  system  designers  for  self-guiding,  adaptive,  and  yet 
more  sophisticated  man-machine  interfaces.  The  merging  of  wide-band  digital  communication  networks  with 
computer  technologies  will  make  it  possible  to  interconnect  computers,  information  centers,  word  processors,  and 
other  peripherals,  worldwide.  Techniques  of  tabular  and  graphical  fact  retrieval  are  examined.  The  prospects  of  fact 
retrieval  by  voice,  touch  screens,  and  videotext  are  discussed.  The  potential  of  two  unusual  three-dimensional  display 
techniques,  the  computer-generated  time-resolved  integral  hologram  and  the  projection  of  virtual  data  images  into 
space,  are  discussed.  We  conclude  by  examining  the  resulting  problems  and  some  solutions  by  example  of  our 
experience  with  the  integrated  Technology  Information  System  at  the  Lawrence  Livermore  National  Laboratory. 

"Pact  retrieval  is  the  identification  and  use  of  information 
about  events  and  measurements  by  techniques  that 
increase  our  knowledge,  understanding,  and  ability  to 
simulate  and  predict  social  and  natural  phenomena." 


1.  INTRODUCTION 

The  1980s  may  well  be  called  the  decade  of  information.  Quick  access  to  factual  information,  the  ultimate 
product  of  the  post-industrial  society,  has  been  given  a  boost  in  the  1970s  by  the  merging  of  computer  technology  with 
communications.1  This  made  it  possible  for  us  to  generate,  validate,  and  disseminate  information  faster  and 
cheaper,  and  has  brought  it  within  the  reach  of  any  telephone. 


Traditionally,  reports  and  books  are  used  to  convey  the  significance  of  an  event  or  measurement.  The  embedded 
facts  are  then  extracted  and  compiled  in  topical  lists  which  serve  as  concentrated  knowledge  databases  for 
comparison,  evaluation,  dissemination,  and  as  a  starting  point  for  future  work.  These  compilations  of  facts  are 
structured  to  retain  the  essential  attributes  of  their  origin  and  qualify  each  individual  fact  for  later  use  without 
descriptive  text.  It  should  not  surprise  us  to  observe  that  the  historic  use  of  computers  for  storage  and  retrieval  of 
general  information  in  science  and  technology  has  followed  the  same  course. 

Textual  and  bibliographic  databases  were  created  first.  This  marked  the  beginning  of  widespread,  automated 
text  retrieval  services  by  machine,  although  digital  computers  are  Innately  better  suited  to  process  numbers.  The 
large  bibliographic  collections,  now  fully  up-to-date  in  most  fields  of  interest,  represent  a  comprehensive  online  index 
to  the  books  and  reports  of  our  civilization.  By  the  end  of  the  1970s,  1000  discipline-oriented  files  provided  quick 
reference  to  150  million  citations  of  the  abstracted  literature.  Of  these,  some  450  databases  were  identified  as 
being  available  online.  They  are  the  "sine  qua  non"  foundation  on  which  we  can  build  the  evaluated,  numeric  data 
files.  Indeed,  the  relative  ease  by  which  text  could  be  edited  interactively  and  printed  in  different  fonts,  gave  rise  to 
numerous  computer-aided  typesetting  techniques,  a  great  Improvement  over  previous  manual  and  mechanical  means  to 
set  type,  although  printing  and  distribution  of  the  literature  continued  by  traditional  means. 

The  creation  of  numeric  data  files  of  general  interest  followed.  One  of  their  categories  is  the  huge  collection  of 
files  created  by  automated  sensors  to  describe  the  time-  and  space-dependent  phenomena  of  demographic  and 
environmental  observations.  Evaluated  data  of  material  properties,  which  are  costly  a.id  complex  to  generate  and  to 
retain  by  computer  in  traditionally  accepted  forms,  have  made  their  appearance  for  use  by  a  broader  user  community 
only  in  recent  years.  The  total  number  of  fact  files  has  been  estimated  to  exceed  10,000. 


Work  performed  under  the  auspices  of  the  U.S.  Department  of  Energy  by  the  Lawrence  Livermore  National 
Laboratory  under  contrast  number  W-740S-ENQ-48. 
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INFORMATION  CONTENT  OF  DATABASES: 


Ymt 


NUMBER  OF  AVAILABLE  DISTINCT  DATABASES: _ 9 


Y«ar 


This  trend  toward  numeric  data  files  is  expected  to  receive  competition  from  electronic  word  processors.  The 
rapid  generation  of  written  human  communications  by  machine,  and  their  sharing  over  wide-band,  digital 
communications  networks  with  other  machines  and  powerful  computers,  are  bound  to  create  a  natural  mix  of  factual 
information,  both  textual  and  numeric.  To  work  productively  with  these  facts  in  large  volumes  will  require  the 
extraction  of  higher  intelligence  in  concentrated  forms.  The  problem  is  magnified  by  the  fragmentation  of  the  original 
information,  dissimilar  formats,  and  the  necessity  to  presen*  results  in  concert.  This  will  challenge  our  ingenuity  for 
innovative  techniques  of  fact  retrieval  in  the  1980s:  How  can  we  best  establish  practical  procedures  for  the  creation, 
storage,  identification,  validation,  and  display  of  the  diversity  and  masses  of  factual  data  convincingly  and  with 
relative  ease? 

The  immense  power  derivable  from  rapid  access  to  accurate  and  ip-to-date  economic  and  technical  information 
has  been  recognized  by  corporations  and  nations.  Information  management  by  machine  jn  multinational  business 
enterprises  is  reported  to  have  increased  corporate  profits  by  as  much  as  30%  in  1-2  years.2  It  is  our  hope  that  the 
apprehensions  expressed  at  the  recent  7th  international  CODATA  conference  in  Kyoto,  Japan,  alluding  to  the  potential 
prospects  of  data  piracy  and  information  monopolies  in  science  and  technology,3’4  may  give  way  to  a  mutual  and 
controlled  sharing  of  this  emerging  powerful  resource.  This  will  help  developing  countries  to  catch  up,  increase  the 
productivity  of  industrially  developed  nations,  and  benefit  mankind,  in  general,  rather  than  a  privileged  few. 

In  this  paper,  I  would  like  to  review  the  status  of  fact  retrieval  in  science  and  technology  as  1  have  had  the 
privilege  to  observe  and  practice  it  at  a  large  national  laboratory  where  advanced  computers  and  communications  are 
commonplace.5  In  the  parlance  of  Herman  Kahn,  this  should  make  it  possible  to  make  surprise-free  projections  for 
the  years  to  come. 

2.  THE  STATUS  OF  FACT  RETRIEVAL  TODAY 

There  is  a  saying  that  we  must  learn  from  the  past,  lest  we  be  damned  to  repeat  its  mistakes.  Our  minds  filter 
bits  of  information  continuously.  We  compare  them  with  past  experience,  establish  their  worth,  and  remember  the 
essential  facts  for  future  use  in  an  unending  chain  of  analysis  and  synthesis.  For  these  bits  of  information  to  be 
valuable,  they  must  be  factual,  up-to-date,  accurate,  accessible,  concise,  usable,  and  controlled. 

"Fact  and  Value" 


Valuable  facts  are  more  than  numbers  alone.  They  can  be  descriptions  of  events,  or  well  articulated  and 
accredited  postulates.  For  example,  the  letter  written  by  Albert  Einstein  to  President  F.  D.  Roosevelt6  expressing 
apprehension  that  scientists  in  Germany  may  have  split  the  atom,  and  that  it  should  be  possible  to  build  a  bomb 
because  more  neutrons  were  thought  to  be  liberated  in  the  fission  process  than  were  lost  or  absorbed,  and  that  the  U.S. 
could  preempt  them.  This  letter  did  not  contain  a  number.  But  it  was  so  valuable  at  the  time  that  it  started  the 
enormous  Manhattan  Project,  led  to  the  early  end  of  World  War  U,  and  ushered  in  the  nuclear  age.  I  am  pointing  this 
out  to  emphasize  that  fact  retrieval  need  not  be  limited  to  numerical  data  to  be  of  value.  Indeed,  the  extraction  of 
higher  intelligence  from  descriptive  text  may  well  be  of  greater  worth.  It  is  capable  of  projecting  our  dreams  and 
plans  farther  into  the  future  than  the  knowledge  of  a  more  accurate  event  or  measurement  alone. 

The  extraction  of  new  insights  from  large  volumes  of  information,  be  they  text  or  data,  is  clearly  the  challenge 
of  the  future.  The  real  payoff  in  fact  retrieval  from  the  literature  and  numeric  data  files  is,  therefore,  not  only  the 
speed  by  which  this  retrieval  can  be  accomplished,  but  the  new  insight  and  understanding  to  be  gained. 


Shrinking  time  gap  between  initial 
discovery  and  final  development. 


i  a 
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"PatednesB  and  Accuracy  of  Facts" 

Our  judgement  is  only  as  good  as  our  knowledge!  And  yet,  our  information  systems  today  are  still  hampered  by 
long  delays  before  factual  information  enters  the  main  stream  of  computer-aided  retrieval  and  evaluation.  Estimates 
made  in  a  report  by  the  American  Physical  Society  range  from  2  years,  for  announcements  of  ongoing  RAD  in 
Bulletins,  to  7  years,  for  compilations  of  evaluated  numerical  data  derived  from  measurements.  These  estimates  were 
made  in  1970  but  delays  are  probably  not  much  shorter  today.  In  the  United  States,  little  incentive  has  been  given  by 
government  or  industry  to  correct  this  situation.  Other  countries,  more  dependent  on  the  flow  of  know-how  and 
factual  data  from  abroad  than  the  United  States,  where  57%  of  all  publication  iiL the  energy  field  originated  in  1980, 
persisted  in  putting  into  action  deliberate  plans  for  national  information  systems.”  i8'9'10 
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This  inadvertent  delay  before  results  of  ongoing  research  can  be  retrieved  by  computer  is  further  exaggerated  in 
the  United  States  by  the  inadequate  funding  of  Information  Analysis  Centers  which  compile  and  evaluate  the  measured 
data.  Recognized  data  evaluation  centers,  such  as  those  at  the  NBS  Office  of  Standard  Reference  Data,  and  others 
operatic  as  members  of  the  Standard  Reference  Data-  System,  have  been  working  for  years  with  limited  and 
inadequate  budgets.  The  Numerical  Data  Advisory  Board  of  the  National  Academy  of  Sciences,  Committee  on  Data 
Needs,  published  in  1978  a  report  on  National  Data  Needs  for  Critically  Evaluated  Physical  and  Chemical  Data 
(CODAN)7*‘  It  recognized  an  urgent  need  for  the  doubling  of  support  for  data  evaluation  over  a  6-year  span  of  time 
just  to  catch  ig>  tdth  the  backlo^tihxisting  measured  data.  The  overall  national  funding  for  data  evaluation  in  1977 
was  only  16.8  minion,  of  which  the  government  provided  90%.  Little  action  of  any  consequence  has  been  taken  to 
date.  Other  countries,  however,  are  taking  the  initiative  to  capture  the  transborder  information  flow  and  are 
establishing  in  new  and  old  areas  of  RAD  their  own  information  analysis  centers.  We  are  faced,  therefore,  with  a 
situat^MfWhere  data  are  being  measured  with  automated,  computer-based  equipment  in  larger  quantities  than  every 
befoMnxit  where  the  users  of  these  data  in  the  United  States  are  obliged  to  fish  them  out  from  the  general  literature 
argIKake  their  own  value  judgements  of  good  or  bad,  to  remeasure  them,  or  to  buy  data  from  abroad  where  available.* 

"The  Cost  of  Facts” 

It  is  difficult  to  attach  a  price  tag  for  facts  derived  from  news  or  events.  Their  value  is  determined  by  timing 
and  circumstance,  as  was  pointed  out  earlier  with  reference  to  Einstein's  historic  letter. 

Primary  publications  are  priced  by  page  regardless  of  their  value: 
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Online  bibliographic  references  that  point  to  the  fact  that  potentially  relevant  publications  took  place  are  sold 
with  even  less  discrimination  at  constant  unit  price,  regardless  of  value  or  length  of  their  primary  publication: 

Offline  printing  $0.05  —  $5.00  per  citation 

Online  printing  or  viewing  $0.50  —  $50.00  per  citation 


Online  numeric  data  are  not  sold  on  a  unit  basis  today.  As  a  rule,  they  are  part  of  a  larger  file  and  coats  are 
based  on  algorithms  of  Computer  Resource  Units,  storage  costs,  and  connect  times,  as  will  be  shown  later  on.  As  such, 
they  are  much  cheaper  today  than  the  primary  publication  wherein  they  are  embedded,  or  the  online  citations  that 
point  to  them,  oven  though  these  facts  are  the  essential  and  costly  part  of  any  document. 


Col.  A.  Aines,  longtime  advocate  of  data  management  in  science  and  technology  by  computer  has  extended  a 
challe^e,  to  any  one  who  could,  to  compile  a  report  of  horror  stories  where  available  data  were  not  found  in  time,  or 
were  remeasured  at  great  expense;  or  even  better,  where  the  wrong  data  were  used  and  caused  the  failure  or  delay  of 
costly  projects.  So  far,  no  one  turn  volunteered.  Those  involved  are  too  embarrassed  to  speak  up.  Perhaps  we  should 
ask  the  retired  members  of  our  professional  societies  to  speak  to  us  with  courage  of  their  mistakes  and  lemons  learned. 
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Therein  lies  somewhat  of  a  paradox:  Demographic  and  time-series  data  of  our  environment  are  massive,  and 
their  value  may  become  quickly  obsolescent.  But,  when  we  look  at  the  cost  of  accurately  measuring  material 
properties  in  science  and  technology,  we  arrive  at  a  very  high  unit  coat.  In  the  previously  mentioned  CODAN  report,  a 
literary  publication  in  the  sciences  was  estimated  to  cost  $45,000  on  the  average  in  1976  dollars  and  required  $4,000 
for  printing.  This  was  estimated  from  the  time  and  effort  required  to  do  the  measurements,  assuming  a  year's  worth 
of  research  and  a  median  annual  salary  of  about  $22,500  with  100  percent  overhead.  The  absolute  worth  of  data  in  any 
publication  is  difficult  to  judge  in  the  absence  of  compilations  and  comparison  with  other  data  by  expert  evaluators. 
To  establish  their  worth  obliges  the  potential  user  to  familiarize  himself  with  the  subject  matter,  to  review  the 
literature,  and  to  arrive  at  an  unbiased  value  judgement.  This  points  to  the  urgency  and  overall  coat-effectiveneas  of 
authenticated  data  evaluations. 

The  cost  for  compiling  and  evaluating  data  is  comparatively  small.  Based  on  16  years  of  operating  experience 
with  the  Joint  Army  Navy  Air  Force  (JANAF)  Thermochemical  Tables,  the  unit  cost  of  data,  representing  one  material 
property  as  a  function  of  temperature  and/or  pressure,  is  only  $1,000.  Since  ten  publications  are  found  to  support 
one  data  sheet  in  most  cases,  the  investment  of  $1,000  can  capture  the  essence  of  $500,000  in  R&D  expenditures  for 
others  to  use  with  confidence. 


The  scarcity  of  up-to-date  evaluated  data  compilations  of  physical  and  chemical  properties  probably  accounts  for 
their  low  market  value.  It  leads  inadvertantly  also  to  an  unaccounted  transborder  flow  of  costly  primary  data.  Other 
countries  are  making  it  their  business  to  harvest  these  factual  resources  and  to  market  them  with  advantage! 


The  Storage  of  Facts" 

Storage  requirements  of  facts  by  electronic  means  are  assuming  staggering  proportions.  We  are  committing 
more  written  communications,  observations,  and  calculated  data  to  computer-aided  storage  and  retrieval  than  ever 
before.  Estimates  for  some  of  the  large  data  producers  and  users  are:'* 


Lawrence  Livermore  National  Laboratory 

Bank  of  America 

Shell  Oil  Development 

Social  Security  Administration 

Exxon  Geophysics  Logging 


50,000  Tapes 
400,000 
600,000  " 

750,000  " 

1,000,000  " 

2,800,000  " 


This  represents  (at  a  storage  density  of  6250  bits  per  inch  for  a  0.5-in.-wide  magnetic  tape  of  2400  feet,  inclusive  of  a 
30%  waste  due  to  record  gaps),  a  conservative  capacity  per  tape  of  1  x  10®  bits,  or  an  overall  requirement  for  the 
above  sample  of  organizations  of  about  101®  bits.  Only  the  IBM  Photo-Digital  storage  systems  with  10“  bits 
capacity  each,  came  close  to  filling  such  a  demand;  but,  all  of  these  mechanical,  wet-chemical  systems  are  now 
retired.  Today,  as  we  are  entering  the  1980s,  the  projected  state-of-the-art  for  storage  technology  is  estimated  to  be: 

Magnetic  10®  bits/cm  2 

Optical  10  bits/cm® 

Electronic  10**  bits/cm2 


Of  these,  only  the  magnetic  tape  storage  media  represented  by  Automatic  Tape  Libraries  (ATL)  are  marketed.  To 
appraise  the  situation  at  the  DOE  National  Laboratories,  and  to  make  demand  forecasts  for  future  storage 
requirements,  a  survey  was  made  in  1979  with  the  following  results:13 


Survey  Results  on  Storage  Capacity  at  Nine  DOE  Sites  for  1979. 

Max1«u»  Maxiimja  Mass 

Tepe  disk  Mass  storage 


SUe 

Tape 

drives 

Tapes 

capacity 

(10l?  b'M) 

OiSk 

drives 

Disk 

packs 

capacity 
(1010  bits) 

archive 

storage 

used 

(101?  bits) 

Argonne 

24 

17,120 

8.07 

128 

396 

24.0 

none 

- 

Brookhaven 

22 

>00 

1.72 

28 

56 

7.17 

none 

Idaho  Falls 

CT0€R  Systaa 

8 

18.000 

4.68 

12 

20 

2.76 

nag  tape 

8  week 

IBM  360/7 5  Syste*  9 

23.600 

4.60 

26 

40 

3.2 

backup  Only 

Femi  Lab 

24 

30,000 

5.85 

20 

20 

3.2 

ATL 

LASL 

44 

50,000 

6.5 

70 

75 

6.0 

IBM  1360 

2.2 

(fixed  head) 

S3 

12.7 

IBM  3850 

2.3 

LBL 

22 

41,500 

7.07 

24 

24 

2.82 

IBM  1360 

3.2 

♦  30,000 

IBM  3850 

archived 

LLL 

50 

45,000 

4.41 

no 

175 

152.0 

IBM  1360 

3.3 

ATL  ♦ 

C0C  38500 

1.0 

Sandia/AlS 

45 

26,000 

1.74 

77 

150 

1.51 

none 

Oak  Ridge 

39 

36,500 

6.54 

134 

430 

39.0 

none 

Total 

286 

325.720 

51.2 

629 

1009 

754.4 

A  data  sheet  may  be  a  single  reaction  rate  constant  as  a  function  of  temperature.  By  1978,  2239  data  sheets  had 
been  produced  In  16  years. 
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Survey  Results  on  Projected  Storage  Capacity  at  Nine  DOE  Sites  for  1984. 

Maximum 
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disk 
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drive* 
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8.06 
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20 
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5 
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- 
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80 

46 
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14 
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2 
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17 
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10.1 

16 
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32 
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11.4 
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1 
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2 
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0.15 

4 

18 

tap* 

cartridge 

04 

Total 

236 

246,120 
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208.3 

- 

- 

We  observe  an  overall  increase  of  anticipated  archival  storage  by  a  factor  of  tot  for  1984,  and  a  simultaneous 
decrease  of  installed  equipment.  One  of  the  interpretations  offered  is  an  expectation  that  optical  devices  should 
indeed  be  capable  of  providing  10  times  the  storage  density  now  possible  on  magnetic  tape.  The  results  of  this  survey 
were  used  to  set  specifications  for  storage  devices  in  the  mid  1980s.  However,  to  date  we  have  received  no  indication 
from  serious  respondents.  An  extract  from  our  specifications  follows. 

EXPECTATIONS  FOR  ARCHIVAL  MEMORY  PERFORMANCE14 


Property 

1980 

1982 

1984 

System  capability  (bits) 

£2x  1011 

BIO13 

BIO15 

Read-write  stations  (No.) 

—  2 

i4 

B10 

Read-only  stations  (No.) 

-2 

a  10 

B30 

Accesses  per  hour 

— 2000 

'•8000 

~ 30000 

Transfer  rate  (bits/s) 

~1°? 

~4x  107 

~10  n 

System  bit  error  rate 

'10-  ‘ 

*10"" 

Undetected  bit  error  rate 

^ 10~12 

dO-14 

« 10-16 

The  rates  at  which  data  are  being  added  today  are  equally  staggering:  At  LLNL,  anyone  of  the  four  CRAY  computers 
can  generate  10^  bits  of  data  per  hour  for  a  typical  2-D  hydrodynamic  calculation.  As  a  rule,  only  1/10  of  that,  or 
10®  bits,  need  probably  be  saved.  A  yet  larger  data  source  is  the  Landsat  satellites  where  one  pass  would  produce 
10 16  bits  of  data,  were  it  not  for  selective  data  suppression  and  compression.  But  the  techniques  of  file  retrieval 
from  these  large  storage  media  are  not  our  concern  in  this  paper.  They  are  specialized  and  are  not  likely  to  affect  the 
general  user  requirements  of  fact  retrieval.  Most  of  our  needs  in  the  1980s  will  probably  be  well  satisfied  by 
extraction  of  facts  from  prestaged  archival  data  files,  and  from  interactions  with  less  powerful,  but  more  numerous 
minicomputers  that  have  entered  our  working  environment. 

"The  Analysis  of  Facts" 

Programmatic  requirements  of  the  Department  of  Energy  have  contributed  significantly  to  the  design  and 
development  of  high-speed  computers.  Each  new  generation  of  the  fastest  machines  has  been  used  for  data  analysis 
and  to  simulate  intricate  models  of  nuclear  weapons,  laser  fusion,  magnetic  fusion,  reactor  safety, 
biological/environmental  phenomena,  and  socioeconomic  energy  predictions.  JSut  the  rate  of  performance 
improvement  is  slowly  down.  It  took  all  of  the  1970s  to  gain  another  factor  of  ten.15 
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Recent  advances  in  Josephson  junctions  and  the  Gaits  technology  suggest  yet  another  quantum  jump  to  IQ9 
operations  per  second  in  the  1980s.  The  significance  of  this  expectation  is  that  another  generation  of  even  more 
powerful  micro-  and  minicomputers  may  also  become  available  for  general  use.  This  would  accelerate  the  dramatic 
changes  of  the  traditional  marketing  and  utilization  of  bibliographic/numeric  data  that  we  are  seeing  today.  The 
post-processing  of  retrieved  information  toward  higher  forms  of  intelligence  is  clearly  the  challenge  of  the  1980s. 

"Directories  to  Databases  of  Facts1' 


To  access  and  analyze  information,  one  must  first  know  where  to  find  it!  The  publishing  business  traditionally 
prints  periodic  updates  of  reference  books  on  topical  issues.  In  recent  years,  these  publications  were  augmented  by 
printed  directories  to  computer-based  resources.  The  current  "Directory  of  On-Line  Databases,"  by  Cuadra  Associates 
lists  770  data  files  and  135  on-line  services.19  The  "Directory  to  Computer  Readable  Databases,"  updated  each  year 
by  Martha  Williams  under  auspices  of  the  American  Society  of  Information  Science,  shows  in  its  1979  edition  528 
distinct  data  files.1'  The  corresponding  international  directory  to  computer-based  systems  with  European  emphasis, 
EUSIDIC,  published  by  Learned  Information,*  Inc.,  lists  more  that  1280  databases.1"  The  somewhat  dated  report  by 
Westbrook  on  data  sources  for  materials,  cites  in  1978  about  300  data  files  in  science  and  engineering.19 
Unfortunately,  these  directories  are  not  yet  available  online  even  though  computers  were  used  to  compile  them  and  to 
print  them.  This  is  bound  to  change  in  the  near  future  as  government  and  business  recognize  the  value  of  these  master 
guides  to  information  stores. 

"The  Tools  of  Fact  Retrieval" 


Data  storage  and  retrieval  was  the  goal  of  the  1970s,  and  different  database  manipulation  software  was  built 
commercially  and  at  universities  to  bring  it  about.  We  distinguish  two  classes: 

1)  Data  Management  Systems  (DMS).  which  permit  access  to  and  retrieval  from  already  existing  files,  usually 
for  single  applications.  Most  bibliographic  online  information  retrieval  systems  are  of  this  type. 

2)  Database  Management  Systems  (DBMS),  which  manage  and  maintain  data  in  a  prescribed  structure  for  the 
purpose  of  being  processed  by  multiple  applications,  independent  of  storage  device  class  or  access  method. 
They  organize  data  elements  in  some  predefined  arrangement  in  a  database  and  retain  relationships 
between  different  data  elements  within  the  database..  These  systems  are  commonly  used  with 
numeric/structured  data  under  the  user's  controL 

For  large  volumes  of  data,  like  those  being  communicated  to  earth  from  observation  satellites,  efficient  and 
specialized  programs  were  developed  and  are  usually  not  suited  for  generalized  applications  by  casual  users.  For 
comparatively  small  and  diversified  collections  of  data,  a  host  of  more  generalized,  less  efficient  data  management 
systems  have  evolved.  Depending  on  the  logical  relationship  among  data  sets  and  data  elements,  the  major  DBMS 
models  now  in  use  are  those  best  capable  of  working  with: 

o  simple,  COBOL-like  'flat  files', 

o  hierarchical  or  tree-like  data, 

o  graphs  and  networks,  (CODASYL), 

o  relational  tables,  and 

o  data  well  represented  by  extended  set  theory  models. 

No  definite  bounds  exist  among  the  systems  that  implement  these  models.  Most  systems  show  considerable 
flexibility.  No  concensus  has  been  reached  to  define  the  best  model  for  general  database  work.  An  overview  of  their 
relative  merits  and  historic  dependence  was  made  by  Fry.20  But,  to  those  shopping  for  potential,  suitable 
candidates  among  the  large  number  of  systems,  their  underlying  theory,  strengths,  and  limitations  may  only  be  of 
partial  interest.  The  questions  more  likely  to  be  asked  concern  user-oriented  features;  the  types  of  computers  and 
operating  systems  on  which  the  systems  can  be  installed;  the  host  languages  to  which  they  interface;  and  cost. 

The  online  bibliographic  retrieval  systems  and  services  have  changed  little  since  their  inception  in  the  late 
1960s.  Only  the  numbers  of  online  topical  data  files  and  citations  have  quadrupled.  Many  still  seem  to  believe  that 
the  greater  the  number  ot  citations  retrieved  in  retrospective  searching,  or  by  Selective  Dissemination  of  Information 
(SDI),  the  better  the  service.  Relevancy,  reoal',  and  precision  were  hotly  debated  issues  and  vary  with  the 
sophistication  of  the  indexing  and  retrieval  method,  and  whether  keywords,  and/or  titles,  and/or  abstracts  are 
scanned.  At  LLNL  we  tried  and  documented  several  experiments  and  settled  on  searching  by  indexed  keywords  and 
words  in  titles.  Abstracts  introduced  too  much  noise.  Most  of  the  bibliographic  systems  are  patterned  after  DIALOG 
(Lockheed)  and  provide  Boolean  operators  for  retrieval  of  whole  words  or  compound  expressions  from  indexed  tables  of 
keywords  and  authors.  Sets  can  be  created  and  combined  to  further  refine  a  desired  result.  Few  bibliographic 
information  management  systems  offer  capabilities  for  interactive  algebraic  work  with  numbers.  F re  e-text  searching 
of  titles  or  abstracts  is  less  frequently  available.  But,  regardless  of  how  these  systems  retrieve  their  citations,  nearly 
all  bibliographic  information  today  is  being  delivered  as  a  pile  of  paper.  At  best,  it  is  printed  or  flashed  on  the  CRT 
screen  in  reverse  chronological  order  of  its  publication  date,  requiring  the  hapless  recipient  to  just  look  at  it! 


TwdWonW  SOI  Ssgjsr. 


"Look,  how  much  t  found  for  you.  ■ 
"You  can't  it on  it  In  your  mini. . . " 
"You  can’t  toproduca  it  by  any  mount. . . 

. .  -hnt  took  at  it. . . " 
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Retrieval  with  embedded  character  strings,  weighting  of  terms,  and  proximity  indicators  have  become  more 
prevalent  with  the  large  legal  text  retrieval  systems,  e.g.  JURIS,  LEXIS,  and  WESTLAW,  among  others. 

A  detailed  comparative  analysis  of  numeric/structured  data  management  by  means  of  the  commercial  systems, 
or  by  university  systems,  is  difficult  because  of  the  continuous  change  exhibited  by  these  systems,  estimated  to 
number  about  65  worldwide.21  Suffice  it  to  say  that  most  of  them  tend  to  be  modular  with  interfaces  to  procedural 
languages  and  to  statistical  and  graphical  programs.  Excellent  seminars  on  "Comparative  Database  Management"  are 
being  offered  each  year  by  the  University  of  California  Extension  Division,  Los  Angeles.  Comparisons  of  21  systems 
and  their  capabilities  were  made  by  Auerbach22  and  Datapro.  3 

One  of  the  most  promising  retrieval  systems  for  numeric/structured  data  developed  during  the  1970s  for  general 
use  is  the  Chemical  Information  System,  (CIS),  sponsored  by  the  National  Institute  of  Health  (NIH)  and  by  the 
Environmental  Protection  Agency  (EPA). 

CIS  is  a  network  of  chemical  databases  equipped  with  computer  programs  that  permit  interactive  searching  and 
retrieval  from  these  databases.  At  the  heart  of  the  network  is  the  Structure  and  Nomenclature  Search  System 
(SANSS)  which  is  used  to  identify  a  chemical  substance,  given  its  name  or  its  structure,  and  refer  the  user  to  all  CIS 
files  that  contain  data  on  the  compound.  Any  such  data  can  then  be  retrieved  with  simple  commands  from  the 
appropriate  node  of  the  CIS  network.  A  different  way  of  using  the  CIS  is  to  enter  experimental  data,  such  as  mass 
spectral  peaks,  into  the  system,  which  will  identify  the  compound  at  hand,  using  its  files  of  ten  of  thousands  of  mass 
spectra,  nmr  spectra,  or  x-ray  powder  diffraction  patterns.  The  entire  CIS  has  been  developed  by  cooperating  agencies 
of  the  U.S.  Government  and  is  available  in  the  private  sector  for  use  by  the  public  on  a  fee-for- service  basis.  It  is 
accessible  worldwide  through  the  TELENET  telecommunications  network.  Today,  the  CIS  system  is  still  a  collage  of 
files  and  programs  originally  provided  by  developers  of  each  database.  However,  gradually,  a  unifying  approach  is 
being  implemented  that  will  make  CIS  the  most  comprehensive  collection  of  physical  and  chemical  data  files. 

The  CODASYL  reports,24*25  published  early  in  the  1970s,  identified  common  and  desirable  features  of 
database  management  systems  and  had  a  very  beneficial  '.cnpact  on  their  evolution.  They  provided  a  basis  for  the 
database  architecture  and  Federal  database  standards.25  Generalized  Database  Management  systems  (GDBMS), 
tailored  to  account  for  attributes  and  peculiarities  of  data  in  science  and  technology,  did  not  materialize  as  yet.  In 
most  cases  they  were  adaptations  of  systems  used  in  business.  Attempts  to  point  out  the  special  requirements  for 
scientific  data  were  made  by  the  specialists'  conference  on  "Generalized  Data  Management  System  and  Scientific 
Information"  in  1978,  sponsored  jointly  by  the  Department  of  Energy  Office  of  Technical  Information  and  the  Agence 
de  l'OECD  Pour  l'Energie  Nuclaire.2”  This  was  followed  by  a  NASA-sponsored  conference  on  Engineering  and 
Scientific  Data  Management28. 

For  those  who  may  wish  to  further  explore  the  evolution  of  fact  retrieval  during  the  1970s,  I  would  like  to  call 


and  early  attempts  at  integrating  text  and  numeric  data  3Z.  Additional  topical  literature  is  found  in  the  Annual 
Reviews  of  Information  Science,33  especially  in  Volume  14  of  the  1979  proceedings  of  the  "Online  Information" 
meetings,34  and  the  international  conferences  on  very  large  databases.35 


"The  Marketing  of  Facts" 

Fact  retrieval  from  commercial  vendors  costs  about  twice  that  from  government  installations  where  recovery  of 
expenditures  became  an  operational  requirement  during  the  1970s.  In  the  last  few  years,  the  pricing  has  remained 
reasonably  stable,  except  for  costs  of  connect-times.  We  quote  from  the  Cuadra  Associates  Spring  1981  Edition  of 
the  online  catalog:18 

"Pricing  policies  for  access  to  and  use  of  the  online  database  services  are  extremely  complex.  There  are  a 
number  of  components  to  the  prices  and  they  are  combined  in  many  different  ways.  In  addition,  prices  are  subject  to 
change  with  fairly  short  notice.  All  of  these  factors  make  it  difficult  to  treat  the  topic  in  a  standard  manner. 
However,  there  are  some  general  points  that  can  be  made.  In  at  least  30  percent  of  online  services,  there  is  an 
indication  that  some  type  of  subscription  is  required  for  gaining  access  to  the  database.  These  subscriptions  range 
from  a  few  hundred  dollars  per  year  to  several  thousands  of  dollars.  In  some  cases,  the  user  subscribes  to  a  package, 
which  may  include  one  or  more  databases  and  additional  services  (e.g.,  consulting).  In  other  cases,  the  user  has  several 
options,  each  a  combination  of  a  subscription  price  and  an  associated  usage  charge. 

In  general,  the  major  components  of  the  usage  prices  for  online  database  services  differ  acco-"ng  to  the  type  of 
supplier  and  the  type  of  database  (e.g.,  whether  it  is  a  bibliographic  database  or  a  numeric  database*.  There  are  two 
major  groups  of  policies:  one  for  the  timesharing  firms  and  their  (largely)  numeric  databases,  and  or.,  for  the  others, 
covering  most  of  the  other  types  of  databases.  There  are,  however,  exceptions  in  each  group,  e.g.,  where  a 
timesharing  firm  offers  service  on  a  referral  database  but  prices  it  more  like  a  bibliographic  database. 

Pricing  by  timesharing  firms  in  business  primarily  for  numeric  databases,  requires  for  most  a  monthly  minimum 
(e.g.,  $100  per  month)  that  is  applied  if  the  total  usage  charges  for  a  given  month  do  not  reach  the  minimum  level. 
The  usage  charges  include  the  following  components: 


Connect  Time  $1.00  - 

Computer  Resource  Units  $10.00- 
Disk  Storage 


$21.00  per  hour 

$1.25  per  unit  (varies  with  definition) 
varies 


These  rates  can  also  vary  within  a  specific  service,  depending  on  the  speed  of  the  terminal  being  used  (e.g.,  300  or 
1200  baud)  and  the  time  of  day  in  which  the  processing  occurs  (e.g.,  prime  time  vs  non-prime  time).  In  addition,  the 
Computer  Resource  Units  charged  for  a  particular  database  may  be  greater  than  the  standard  timesharing  rates 
charged  for  other  data  processing  services.  This  difference  occurs  either  because  the  database  system  that  is  being 
used  is  more  demanding  of  resources,  or  because  a  surcharge  has  been  added  to  the  standard  rates  (by  a  multiplier  or 
additive  factor)  as  a  royalty  to  the  producer  of  the  database.  In  most  other  cases,  pricing  by  online  services  is  based 
on  an  hourly  connect-time  rate,  plus  telcommunications  costs  for  network  access,  if  applicable.  The  hourly 
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connect-time  rates  that  are  cited  in  the  supplier’s  literature  may  include  the  royalty,  or  the  royally  may  he  cited 
separately.  The  range  of  connect-time  rates  (including  applicable  royalties)  is  from  about  $25  to  $500  per  hour.  The 
average  is  approximately  $65  per  hour. 

For  bibliographic  and  some  referral  databases,  there  is  an  additional  charge  for  offline  printing,  which  is  generally 
based  on  the  number  of  citations  or  pages.  The  range  of  charges  for  offline  printing  is  from  $u.(i5  to  $5.00  per 
citation,  although  for  a  few  databases  they  may  be  considerably  higher.  The  average  is  about  $0.15.  mere  may  also 
be  charges  for  online  printing  (i.e.,  displaying  retrieved  information  directly  at  the  terminal),  lhe.se  fees  range  from 
$0.50  to  $50.00  per  item.  Both  online  and  offline  printing  charges  may  vary  depending  on  th  •  amount  of  information 
that  is  printed.  In  some  cases,  the  use  of  a  service  involves  a  startup  fee,  which  often  cover.,  account  setup,  initial 
training,  and  materials.  Occasionally  this  startup  fee  also  includes  the  cost  of  a  special  termini!  and/or  ieased  lines  to 
the  online  service's  computer.  Many  of  the  online  services  that  focus  on  the  provision  of  dat.ib  ise  access  also  provide 
volume  discounts  or  have  subscription  plans  that  provide  for  various  levels  of  connect-time  rules,  depending  upon  the 
expected  level  of  usage." 

The  cost  of  using  the  CIS  is  approximately  $45  per  connect  hour  for  most  components,  $75  per  connect  hour  for 
the  major  files,  notably  SANSS  and  PDSM.  In  addition  to  the  search  costs,  there  is  an  annual  $500  subscription  fee, 
which  is  used  to  defray  all  storage  costs.  Costs  now  levied  by  other  government  information  centers  are  similar. 

Search  times  per  database  require  on  the  average  about  15  minutes  and  are  reported  to  range  from  just  a  few 
minutes  to  more  than  an  hour.  If  we  consider  the  5  million  searches,  or  queries,  conducted  in  1979  against  the  150 
million  records  of  computer-readable  files  in  the  United  States  and  Canada  alone,-*6  one  arrives  at  an  estimate  for 
the  total  commercial  revenue  of  marketing  information  on-line:  $125, 000, 000/year.  Actual  costs  to  the  buyer  who 
provides  the  information  specialists,  terminals,  and  research  facilities,  are  much  greater. 

"Other  Issues  of  Fact  Retrieval" 


There  are  other  issues  which  have  their  origin  in  the  1970s:  Wide-band  communications,  the  electronic  office, 
user-friendly  interfaces,  security,  copyrights,  and  novel  ways  of  fact  analysis  and  display.  Because  of  their 
significance  in  the  years  to  come,  I  treat  them  jointly  by  discussing  their  expected  state  of  technology  in  the  next 
decade. 

3.  PROSPECTS  FOR  THE  FUTURE 

The  rapidly  increasing  high-speed  digital  communications  are  starting  to  link  previously  separate  communication 
channels.  Voice,  data,  and  video  are  beginning  to  serve  as  an  integrated  medium  for  fact  retrieval.  Their  analysis  and 
synthesis  on  powerful  micro-  and  minicomputers  will  bring  about  new  forms  of  communicable  intelligence.  This,  in 
turn,  will  increase  the  value  of  factual  information  and  introduce  controls  for  its  exchange  and  use. 

"The  Communication  of  Facts" 


Faster  and  cheaper  computers  forced  increasing  demands  for  high-speed  transmission  of  digital  data  in  the  past 
decade.  Rates  increased  from  9,600  bps  for  early  telephone  lines  to  12-14  GHz  for  satellites  approved  in  January, 
1981,  by  the  Federal  Communications  Commission  (FCC)  for  Satellite  Business  Systems  (SBS).  The  impact  of  this 
change  in  a  relatively  short  span  of  time  will  even  be  greater  when  the  additional  20  domestic  satellites  authorized  by 
the  FCC  last  December  come  into  operation.  This  will  permit  full  integration  of  voice,  data,  video,  and  image 
transmission.  The  advantages  are  bound  to  affect  every-day  communication  for  fact  retrieval: 

1)  Communication  costs  will  cease  to  be  a  function  of  distance. 

2)  Point-to-multipoint  broadcasting  will  make  simultaneous  updating  of  distributed  databases  practical. 

3)  Organizational,  computer-based  networks  could  be  established  virtually  overnight. 

4)  The  higher  frequencies  will  permit  the  use  of  smaller  earth-station  antennas,  5-7  m  in  diameter. 


At  the  Lawrence  Livermore  National  Laboratory  we  now  use  two  antennas  with  the  WESTAR  satellites  which  link 
the  Magnetic  Fusion  Energy  Computer  Center  (MFECC)3"  network  with  Princeton  University.  However,  for  moat  of 
us,  it  will  take  several  years  before  the  required  number  of  ground  stations  are  up  and  ready  for  use,  end  before  the 
band  width  for  linking  to  these  ground  stations  can  be  increased  by  local  hyperchannel  networks  or  fiber-optics 
communications,  among  others. 


6-9 


Today,  as  this  paper  is  going  to  press,  co6ts  per  month  for  cross-country  communication  between  San  Francisco 
and  Washington,  D.C.,  are  typically  those  shown  below:+ 


RATE  ANALOG  DIGITAL 

With  Modems  Without 


1,200 

bps 

$  2,140 

$  2,048 

Not  Offered 

2,400 

bps 

2,218 

2,048 

$  2,124 

4,800 

bps 

2,430 

2,048 

2,338 

9,600 

bps 

2,756 

2,048 

2,684 

19,200 

bps 

5,942 

2,048 

Not  Offered 

56,000 

bps 

25,49° 

2,048 

11,286 

<(Vendor  sources  unknown,  conditioning  would  be  required. 

No  source  available  for  vendor-supplied  Data  Service  Units  (DSUs)  required  to  operate  on  the  D.D.S.  network. 
+  Private  communications  with  Jack  Hibbard,  LLNL,  Communications  Office,  July  1981. 


The  rates  quoted  are  with  and  without  Bell  modems.  Installation  charges  for  analog  communication  are  $100  without 
modems,  and  range  from  $254  to  $639  with  modems. 

The  digital  computer  networks  ARPANET,  TYMNET,  and  TELENET,  made  their  entry  in  the  1970s.  Of  these; 
ARPANET  is  sponsored  by  the  Department  of  Defense  and  was  the  first  large-scale  experiment  in  computer 
communications  by  linking  various  large  machines  on  many  university  campuses  and  government  installations. 
Although  successful  for  medium-size  file  transfer  between  computers,  the  50K-bit  lines  and  packet  switching 
techniques  are  inefficient  for  low-speed  terminals  operating  in  full  duplex.  This  does  not  really  matter  since  traffic  on 
the  ARPANET  is  still  comparatively  light.  (Total  number  of  bits  moved  on  the  MFECC  net  exceeded  that  of  the 
ARPANET  already  in  1974.  ®  Nevertheless,  ARPANET  became  one  of  the  most  widely  studied  and  publicized 
networks. 
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TYMNET  developed  quite  differently.  Its  primary  purpose  was  to  interface  large  numbers  of  low-speed  terminals 
to  a  relatively  small  number  of  time-shared  computers  operated  by  Tymshare,  Inc.  Most  of  these  terminals  also 
required  full-duplex,  character-by-charaoter  interaction  with  the  machine.  The  inefficiencies  echoing  characters  back 
and  forth  were  solved  by  virtual  circuits  where  data  from  many  users  shared  the  same  physical  record  during 
transmission,  so  that  the  overhead  of  checksums  and  record  headers  could  be  divided  among  the  users.  Also,  by 
controlling  flow  node-to-node,  rather  than  circuit-end-to-circuit-end,  there  was  no  need  to  signal  back  a  message 
requesting  more  data.  This  efficient  operation  permitted  40  300-baud  interactive  terminals  to  be  served 
simultaneously  by  one  2,400  bps  line.  In  1972  the  National  Library  of  Medicine  put  the  first  non-Tytnshare  computer 
on  the  network.  Since  then,  this  Value  Added  Network  (VAN),  providing  automated  speed,  code,  and  protocol 
conversion  among  terminals  and  computers,  and  error  recovery  by  routing  records  around  inoperative  paths,  has 
become  one  of  the  largest  networks  with  more  than  25,000  daily  users  in  1979,  and  connections  to  overseas. 

TELENET  commercialized  the  ARPANET  concept  in  1975  and  was  approved  as  an  intern- ■■  ional  record  carrier. 
It  interconnects  to  TYMNET  and  TRANSPAC  in  Europe,  and  the  Canadian  CNCP  and  TOTS  networks.  These  three 
networks  will  continue  to  have  a  significant  role  for  ground  communication,  and  ihwreonnect  to  satellite  traffic  as 
required.  Existing  and  planned  cable-video  connections  are  expected  to  assume  a  significant  share  of  the  digital 
communications  market. 

The  interconnection  of  these  and  other  national  carriers  has  brought  about  an  international  worldwide  network. 
HUsenrath  at  the  National  Bureau  of  Standards  has  recently  surveyed  the  field  and  reports  250  different  numerical 
data  systems  of  which  53  deal  with  physical  and  chemical  properties.39  The  networks  are  TYMENT,  GTE  TELENET, 
CYBERNET,  GE  GEISCO,  and  CYPHERNET. 
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"Standards  for  Data  Exchange11 

The  opportunity  to  communicate  computer-based  information  in  large  quantities  introduced  the  necessity  of 
standards. 

For  communications  among  computers,  the  X.25  communications  protocol  is  probably  one  of  the  most 
remarkable  achievements  of  the  past  decade.  It  has  simplified  the  physical  interconnection  of  host  computers  over 
networks.  However,  the  interprocess  communications  among  computers  requires  either  a  translation  of  often 
incompatible  primitive  operators,  or  their  standardization.  Both  approaches  are  presently  in  progress.40*41  The 
Systems  and  Network  Architecture  Division  of  the  National  Bureau  of  Standards  has  the  mandate  to  study  the  issues 
involved  and  to  formulate  standards  in  digital  communications.  The  related  problems  and  opportunities  for  the  then 
Energy  Research  and  Development  Administration  (ERDA)  were  studied  and  reported  by  the  Working  Group  on 
Computer  Networking  of  the  Office  of  Engineering,  Mathematics,  and  Geosciences.42  Problems  and  solutions  of 
distributed  data  mangement  and  computer  networks  are  reviewed  each  year  in  conferences  at  the  Lawrence  Berkeley 
Laboratory.43 

For  communication  of  facts  among  dissimilar  databases,  several  standards  evolved  in  the  1970s: 

ANSI  239.2  -  1971  American  National  Standard  for  Bibliographic  Information  Interchange  on  Magnetic  Tape. 

ISO  2709  -  1973  Documentation  -  Format  for  Bibliographic  Information  Exchange  on  Magnetic  Tape. 

ISO  646  -  1973  7-Bit  Coded  Character  Set  for  Information  Processing  Interchange. 

ISO  962  -  1974  Information  Processing  -  Implementation  of  the  7-Bit  coded  Character  Set  and  Its  7-Bit  and 

8-Bit  Extensions  on  9-Track  12.7  mm  (0.5  in)  Magnetic  Tape. 

ISO  2022  -  1973  Code  Extension  Techniques  for  Use  with  the  ISO  7-Bit  Coded  Character  Set. 

ISO  2375  -  1974  Data  Processing  -  Procedure  for  Registration  of  Escape  Sequences. 

ISO/DIS  1001.2  -  197x  Information  Processing  -  Magnetic  Tape  Labelling  and  File  Structure  for  Information 

Interchange. 

UNISIST  SC74/WS/20  Reference  Manual  for  Machine-Readable  Bibliographic  Description. 

IAEA/INIS-9  (Rev  1)  INIS:  Magnetic  Tape  Specifications  and  Record  Format. 

TID-4581-R3  ERDA  Energy  Information  Data  Base:  Magnetic  Tape  Description. 

These  standards  deal  primarily  with  bibliographic  information.  The  growing  importance  of  numeric  data  in  the 
mid  1970s  led  to  the  "ERDA  Interlaboratory  Working  Group  for  Data  Exchange,"  which  studied  the  characteristics  of 
the  transmission  of  numeric  data.  Based  on  the  previously  established  standards  for  bibliographic  information,  the 
group  proposed  a  numeric  data  exchange  format,  officially  referred  to  as  the  proposal  for  an  "American  National 
Standard  Specification  for  an  Information  Interchange  Data  Description  File  Format.”  In  1978,  this  draft  became  the 
substance  for  the  ANSI  X3L5  committee,  which  includ’d  several  refinements  and  extensions  recommended  by 
international  reviewers.  The  CODATA  "Task  Group  for  Computer  Use"  endorsed  the  standard  and  asked  UNESCO  to 
have  it  disseminated  in  other  countries  for  potential  acceptance  as  an  internation  ISO  standard.  Formal  transmission 
of  the  final  recommendation  by  the  X3L5  committee  for  ANSI  confirmation  is  scheduled  for  July  1981.  It  may  take  an 
additional  6-9  months  before  the  standard  could  be  officially  accepted,  printed,  and  distributed. 

"The  International  System  of  Units  (SI)  for  Numeric  Facts" 

Shortly  after  World  War  n,  in  1954,  it  appeared  as  if  the  United  States  might  have  the  resolve  to  change  from  its 
English  system  of  units  of  measurements  to  the  metric  rationalized  and  coherent  system  of  units  based  on  the  four 
MKSA  (meter,  kilgram,  second,  ampere)  units,  plus  the  degree  Kelvin  as  the  unit  of  temperature,  and  the  candala  as 
the  unit  of  luminosity.  Although  the  United  States  participated  in  these  international  deliberations,  extension  of 
MKSA  and  the  adoption  of  the  International  System  of  Units  (SI)  did  not  receive  final  approval  before  1976.44 
Public  and  industrial  support  came  even  later.  The  advantages  of  only  one  unit  for  each  physical  quantity  a 
well-defined  set  of  unique  abbreviations  and  symbols,  and  the  retention  of  decimal  multiples  and  submultiples  of  the 
base  unit  for  each  physical  quantity,  have  made  it  now  possible  to  exchange  numeric  databases  with  less  difficulty. 

The  new  SI  system  of  measurement  is  being  adopted  throughout  the  world.  Its  details  are  published  and 
controlled  by  an  international  treaty  organization.  However,  for  a  number  of  reasons,  it  is  inevitable  that  a  few  other 
units  outside  the  system  be  used  with  it.  It  is  this  additional  use  of  non-SI  units  that  leads  to  controversy  and 
difference  between  standards  that  define  modem  metric  practice.  Since  a  wide  variety  of  metric  units  have  been  in 
use  for  years  in  various  parts  of  the  world,  it  is  natural  that  tradition  would  promote  use  of  these  old  units  in  formerly 
metric  countries.  For  this  reason,  many  European  and  international  standards  also  recognize  a  number  of  non-SI  units 
for  use.  To  protect  the  new  system  from  degradation,  and  to  cooperate  with  people  all  over  the  world,  the  SI  standard 
is  recommended  for  all  work  with  numeric  factual  data. 

During  the  transition  period,  where  technologists  are  still  accustomed  to  recognize  the  validity  of  material 
properties  by  their  remembered  values  in  farmer  units  of  measurement,  I  believe  that  database  management  systems 
for  numeric  data  should  offer  the  option  of  displaying  values  in  original  units  of  measurement,  SI  units,  or  both.  It  is  in 
this  manner  that  our  eyes  and  minds  can  be  trained  to  learn,  remember,  and  work  with  the  new  SI  units  with  growing 
confidence. 

"Standards  for  Fact  Attributes" 


Aecredition  of  facts,  especially  of  numerical  data  and  measurements,  requires  a  "shorthand”  notation  with 
attributes  which  a  scientist  or  technologiest  would  accept  in  place  of  the  original  descriptive  text.  The  minimum 
necemry  and  sufficient  set  of  such  attributes  has  been  discussed  in  the  literature.  The  table  below  is  a  representative 
list.45 


At  tribute*  of  Scientific  Data 


t  volue 

2  Uncertainty 

3  Units  of  Measurement 

4  Nor mo I i zq l » on 

5  Validity  Doma  ■  n 

6  Method  of  Measurement 

7  Conditions  and  Constraints 
8.  Type  of  Doto 

9  Source  of  Oota 
<0  Bibliographic  Reference 
t 1 .  Comments 

12  Proprietory  Status,  Class i f icot ion,  etc. 


One  of  the  reasons  why  numerical  databases  in  the  pure  sciences  may  not  have  come  into  greater  use  is  precisely 
the  lack  of  agreement  on  this  topic.  This  may  also  explain  why  a  recent  survey  carried  out  by  the  World  Federation  of 
Engineering  Organizations  still  lists  evaluated  numerical  property  data  banks  as  a  desirable,  yet  less  frequently  used 
resource.4®  A  number  of  relatively  low-level  efforts  are  in  progress  to  correct  this.47’4®  A  more  authenticated 
approach  is  needed.  David  Lide,  Director  of  NBS/OSRD,  emphasizes  in  a  recent  publication  in  Science  that 
standards  are  urgently  needed  for  critical  data  and  data  banks.®  Use  of  such  standards  for  the  reporting  of  numeric 
values  for  a  single  datum,  and/or  a  set  of  data,  in  the  different  disciplines,  could  serve  to  establish  confidence  for  both 
data  evaluators  and  users.  The  NBS  Computer  Institue  for  Science  and  Technology  and  the  Office  of  Standard 
Reference  Data  should  document  their  recommendation  in  the  existing  series  of  Federal  Information  Processing 
Standards  (FIPS). 

"The  Copyrighting  of  Facts" 

Computer  technology  confounded  the  interpretation  of  traditional  copyright  laws.  Unlike  conventional  printing 
techniques,  computer-based  systems  were  capable  of  storing,  processing,  retrieving,  transferring,  displaying,  and 
reproducing  works  of  authorship  with  ease.  In  addition,  any  one  of  the  above  processes  requires,  as  a  rule,  the  fixation 
of  more  than  one  copy,  (core  memory,  disk  storage,  CRT  display,  etc.)  and  could  thus  be  construed  as  a  copyright 
violation.  Also,  authors  and  owners  of  copyrights  of  works  argued  that  their  creations  should  be  patentable,  because  it 
is  the  idea,  procedure,  process,  system,  or  method  of  operation  that  should  be  protected,  rather  than  the  form  (i.e., 
source  program, -code)  in  which  it  is  described,  explained,  or  embodied.  Section  117  of  Public  Law  94-553  of  October 
1976,  therefore,  did  not  confront  these  difficult  issues  and  stipulated  that  computers  did  not  afford  the  owner  of 
copyright  in  a  work  any  greater  or  lesser  rights  with  regard  to  the  use  of  the  work  by  computers  than  those  afforded 
under  sections  of  the  common  copyright  law.  The  National  Commission  on  New  Technological  Uses  of  Copyrighted 
Works  was  established  a  few  months  later  to  study  the  matter.  It  issued  the  final  report  in  July  1978,  and 
recommended  that  Section  117  be  removed  and/or  clarified  to  provide  copyright  protection  for  computer  programs  and 
databases.5®*51  This  was  substantially  enacted  by  Public  Law  96-517  in  December  1980,  with  the  explicit  provision 
that  owners  of  a  software  product  can  copy,  or  authorize  copying  or  adapting,  the  work  without  infringement  on  ti- 
copy right  owner's  rights  if  this  action  either  constitutes  an  essential  step  in  using  the  program  with  a  machine,  .. 
serves  archival  purposes  only. 

The  recommendations  were  based  an  public  hearings  with  particular  attention  to  the  problems  of  copying 
computer-generated  images,  e.g.,  bibliographic  citations.  The  problems  of  copyrighting  numeric  data  were  not 
explicitly  addressed,  and  some  confusion  exists  as  to  whether  numeric  time  series  data  of  demographic  or 
environmental  observations,  and  of  measurements,  can  or  should  be  copyrighted.  Before  1  offer  an  opinion  as  a  user  of 
numeric  data,  I  would  like  to  make  the  following  observation. 

The  rearrangement  of  the  contents  of  a  copyrighted  book,  and  its  printing  and  marketing  in  modified  form  in 
competition  with  the  original  book,  would  probably  be  an  infringement  of  fair  use,  as  defined  in  Section  107  of  PL 
94-553.  A  similar  situation  can  be  inferred  for  numeric  factual  data.  Since  any  information  or  data  with  a  particular 
embodiment,  e.g.,  the  attributes  of  uncertainty,  precision,  normalization,  operating  conditions,  material  history,  etc., 
can  be  protected  by  copyright,  their  rearrangement  and  reproduction  in  whole  or  substantial  part  would  probably  also 
be  interpreted  as  an  infringement  when  it  affects  detrimentally  the  rewards  derived  or  expected  by  the  owner  from 
the  original  copyright.  This  observation,  if  substantiated,  would  have  significant  implications  on  the  three  processes 
by  which  factual  databases  in  science  and  technology  are  created;  collection  (aggregation),  derivation  (analysis),  and 
compilation  (evaluation).  In  each  case,  the  authors  and  owners  of  these  embodiments  can  request  and  receive 
copyright  protection  for  their  new  works.  Royalty  payments  to  the  owners  of  copyrights  for  the  contributing  works 
would  have  to  be  resolved  in  court,  if  not  previously  negotiated. 

Let  us  look  at  some  examples.  The  Department  of  Commerce  makes  available  magnetic  tapes  containing 
numeric  data  of  physical  and  chemical  properties,  evaluated  under  auspices  of  the  NBS  Office  of  Standard  Reference 
Data.  These  databases  are  being  sold  to  recover  costs,  and  are  copyrighted  on  behalf  of  the  United  States,  as 
mandated  in  Sections  5  and  6,  respectively,  of  the  Standard  Reference  Data  Act,  also  known  as  PL  90-396  (1968).  Of 
the  approximately  dozen  magnetic  tapes  now  available  from  the  Department  of  Commerce,  only  half  contain  material 
properties.  The  majority  of  the  numeric  data,  measured  and  reported  by  the  enormous,  tax-supported  research  and 
development  program  in  the  United  States,  are,  to  the  best  of  my  knowledge,  unprotected.  Yet,  they  form  the 
foundation  on  which  compilers  and  evaluators  in  the  United  States  and  abroad  build  their  topical  databases  which  then 
are  copyrighted  and  marketed  without  royalty  payments  to  the  experimentalists,  the  publishers  of  the  primary 
literature,  or  the  United  States  Government.  Compilations  of  numeric  data  are  thus  treated  similar  to  compilations  of 
citations  in  bibliographies.  The  difficulties  in  data  exchange  and  transfer  arise  only  after  such  numeric  compilations 
come  into  being  and  are  copyrighted  in  a  manner  that  is  not  in  the  interest  of  the  United  States. 

In  view  of  the  importance  of  factual  data  in  the  1980s,  the  copyright  issue  of  numeric  data,  and  the  issue  of 
transborder  flow,  will  undoubtedly  be  very  carefully  examined. 
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"The  Electronic  Office,  Video  Transmission,  and  Electronic  Mail" 

Much  has  been  written  on  this  topic.*3  it  is  clearly  big  business  when  we  are  given  the  capability  of 
interconnecting  word  processors  with  computers,  typesetters,  and  graphics  devices,  and  can  send  a  resultant  illustrated 
report  cross  country  in  seconds.  Productivity  and  creativity  are  increased.  Work  satisfaction  is  enhanced  also  as 
secretaries  and  electronic  word  specialists  now  create  camera-ready  copy  that  matches  thequality  previously  reserved 
for  publishing  houses. 

At  the  present  time  there  are  two  competing  yet  complementary  telecommunieat&n  and  computer-based 
technologies  available  for  transmitting,  storing,  retrieving  and  disseminating  large  volumes  of  information 
electronically.  One  of  these,  the  Videotex  (Viewdata)  concept,  exemplified  by  such  systems  as  Antiope,  Presto'.,  and 
Telidon  aims  at  providing  general  information  to  a  mass  consumer  and  general  business  audience  and  is  in  an  early 
developmental  stage.  The  other,  exemplified  by  bibliographic  and  numeric  information  systems,  has  been  discussed 
before.’3 


ELECTRONIC  COMMUNICATIONS  ARE  LINKING; 
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On-line  Informetion  Centert: 

DOE/RECON 

NASA/RECON 


The  Electronic  Office 


. . .  in  the  nation  and  across  continents 


Computer  Centert 


Video  image  and  video  text  transmissions  require  wide-band  communications.  These  can  now  be  realized  locally, 
but  will  not  be  possible  cross-country  at  reasonable  cost  before  sharing  of  satellite  communications  becomes 
commonplace.  (One  256K-bit  picture  requires  at  least  3.6,  minutes  for  transmission  at  common  1200  bps  speeds.)  But, 
more  than  50,000  pictures  can  be  stored  on  a  $4,000  video  tape  or  video  disc  machine  when  connected  to  an  intelligent 
terminal  and  the  appropriate  AD  and  DA  converters. 

TO  PARI*.  BRAZIL  LONOOR. 


TYPICAL  INTERlORNtCTlOR  OT  TTRUINALS  LOCAL  AMO  A l MOT t 
PROCESSORS  AMO  LOCAL  AMO  REMOTE  SERVICES  -Cl 


Electronic  mail,  usually  understood  as  an  expedient  means  of  sending  and  receiving  typed  messages  and  reports, 
is  going  to  be  augmented  by  voice  message  systems.  Voice  input,  although  now  available  with  limitations,  will  take 
longer  to  implement  but  is  certainly  going  to  become  one  of  the  modes  of  man-machine  communication,  together  with 
touch-screen  command  selection  as  marketed,  e.g.,  by  Control  Data  Corporation  in  their  PLATO  system. 

The  conversion  of  video  text  into  ASCII  text  by  means  of  Optical  Character  Reader  techniques  is  still  to  come. 
But,  when  it  takes  place  -  -  and  it  is  technically  feasible  now  -  -  we  will  experience  a  total  integration  of: 


Speech  »  ASCII  Text  e--  -»  Video  Text 


The  translation  of  printed  text  into  speech  is  already  being  offered  by  the  Kurzweil  Reading  Machine  for  the  Blind.** 


Communications  in  the  1980s  will  require  protection  from  eavesdropping  and  abuse.  It  is  against  the  law  to  tap  a 
telephone  line.  However,  the  upcoming  swarm  of  communication  satellites  which  is  starting  to  blanket  large 
geographic  territories  with  data  and  video  transmissions,  virtually  invites  anyone  with  a  parabolic  roof  antenna  to 
"listen"  in.  User  identifications  and  passwords,  if  transmitted  in  clear  form,  could  be  compromised.  Governments, 
businesses,  and  banks  have  long  ago  learned  to  protect  their  interests  by  encoded  transmissions.  The  flurry  of  sales  of 
$2K  to  $15K  roof  antennas  to  harvest  the  multichannel  free  broadcasting  of  television  programs  will  probably  be 
short-lived. 

Future  users  of  satellite  communications  in  science  and  technology  will  have  the  advantage  of  being  able  to 
chose  from  a  variety  of  proven  hardware/software  combinations  to  protect  their  interests.  I  would  like  to  draw  your 
attention  primarily  to  the  public-key  cryptosystems  that  have  been  proposed  toward  the  end  of  the  1970s  and  are  likely 
to  have  a  significant  role  in  the  coming  years.  Their  security  emphasis  has  changed  from  statistical  uncertainty  to 
computational  complexity.  They  have  developed  from  conventional  private-key  cryptosystems  to  public-* ey 
cryptosystems,  providing  instant  privacy  and  two-way  authentication.  Some  of  the  essential  observations,  as 
summarized  by  Abraham  Lempel,  are:55 

Cryptology  and  the  computational  power  of  electronic  computers,  once  mainly  concerned  with  the  making  and 
breaking  of  secure  military  and  diplomatic  communications,  have  now  become  a  major  concern  of  the  public  at  large. 
In  the  age  of  ever-growing  computer  data  banks  and  electronic  fund  transfer,  one  cannot  overestimate  the  importance 
of  encryption  schemes  which  provide  adequate  protection  against  unauthorized,  often  remote,  access  to  stored  data, 
render  the  data  unintelligible  to  unauthorized  listeners  over  a  publicly  accessible  communications  link,  and  incorporate 
a  digital  signature  which  can  serve  as  a  reliable  two-way  authentication.  These  formidable  goals  were  set  up  to  satisfy 
real  market  demands,  answered  in  part  by  the  Data  Encryption  Standard  (DES).  This  is  the  official  NBS  scheme  to  be 
used  by  Federal  departments  and  agencies,  as  well  as  others,  for  the  cryptographic  protection  of  computer  data. 
Those  critical  of  the  DES  argued  that  computers  in  the  early  1980s,  rather  than  toward  the  end  of  the  decade,  would 
have  the  power  to  break  the  DES  code  in  a  day.  Alternate  schemes  have,  therefore,  become  more  attractive. 

The  concept  of  public-key  cryptosystems,  introduced  in  1976  by  Diffie  and  Heilman,  envisioned  a  system  for 
private  communication  that  employs  a  public  directory  in  which  each  subscriber  places  a  procedure  E  to  be  used  by 
other  subscribers  for  the  encryption  of  their  messages  addressed  to  him,  while  keeping  secret  his  corresponding 
decryption  procedure  D.  The  existence  of  such  a  system  would  enable  instant  secure  communication  between 
subscribers  who  have  never  met  or  communicated  before.  For  example,  if  subscriber  A  wants  to  send  a  private 
message  M  to  subscriber  B,  he  looks  up  Er  in  the  directory  under  B,  and  transmits  C  =  EB(M)  in  the  open.  Only  B 
can  decrypt  C  by  applying  his  secret  Dg  to  C. 

One  of  the  major  shortcomings  of  currently  practiced  cryptography -- the  DES,  as  well  as  the  new  public 
schemes --is  the  lack  of  proof  that  any  of  these  schemes  are,  indeed,  as  hard  to  break  as  they  are  claimed  to  be. 
Nevertheless,  encoding  and  decoding  of  factual  data  will  necessarily  become  a  way  of  life  in  the  1980s. 

"The  User  Interface" 

From  the  preceding  review  and  projections,  it  becomes  apparent  that  modern  technology  can  link  virtually  any 
dissimilar  pieces  of  hardware  *nd  software  into  integrated  systems  of  higher  purpose.  Differences  in  format  and 
standards  cannot  be  avoided  and  do  contribute  to  less  efficient  operation  of  the  whole.  But,  they  really  do  not  pose  a 
lasting  hindrance.  Economic  pressu  "3  cause  them  to  conform.  The  essential  key  required  to  unlock  the  enormous 
potential  of  stored  information  ar.d  factual  data  lies  in  the  user  interface.  It  is  the  mediator  between  man  and 
machine.  It  makes  the  formidable  aggregate  of  powerful  computers  and  communications  appear  to  be  human,  and  it 
imbues  us  with  qualities  of  exactness  and  precision  more  likely  to  be  expected  from  a  machine. 


Different  methodologies  were  employed  in  the  past  decade  to  accomplish  this  goal  of  translating  English-like 
logical  requests  into  machine  instructions,  and  vice  versa.  In  moat  cases,  the  interface  has  been  tailored  to  serve  well 
one  particular  Data  Management  System  (DMS),  Data  Base  Management  System  (DBMS),  or  some  related  analysis  and 
graphics  package.  Although  modular  in  nature,  interfaces  and  their  man-machine  intercommunications  usually  became 
an  integral  part  of  the  programs  they  served.  Thus  they  were  seldom  capable  of  being  extended  to  other  systems,  or 
flexible  enoigh  to  accept  the  never-ending  demands  of  an  active  user  community.  (DIALOG  today  is  virtually  the 
same  as  invented  more  than  a  decade  ago.)  References  to  one  or  the  other  approach  can  be  found  in  publications  and 
proceedings. 

Here,  I  would  like  to  report  on  the  performance  of  a  unique  interface,  the  META-MACHINE,5®  designed  and 
developed  by  LLNL  in  collaboration  with  Control  Data  Corporation.  It  serves  as  an  extensible,  flexible,  and  practical 
interface  for  the  integrated  Technology  Information  System  described  toward  the  end  of  this  report.  It  translates 
pragmatic  English  commands  into  meta  instructions  far  an  open-ended  number  of  programs  which  it  controls.  It  can  be 
readily  adapted  to  languages  other  than  English. 
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The  META-MACHINE  is  fully  self-guiding  and  supports  interactive  text  and  data  retrieval,  interactive  modeling, 
electronic  mail,  and  networking.  It  is  the  central  controller  for  all  man-machine  and  machine-machine 
communications.  It  is  currently  installed  on  a  PDP-ll/70  machine  at  LLNL,  using  INGRES  as  the  relational  database 
management  system,  and  UNIX  as  the  operating  system  (2.25  M  bytes  of  core  and  1  B  bytes  of  disk  storage).®' 
However,  INGRES  and  other  subservient  programs  are  invisible  to  the  user.  Unlike  in  other  interfaces,  all 
man-machine  communications  are  deposited  in  an  tNGRES  relational  database  and  can  therefore  be  changed  in  real 
time,  online,  without  recompiling. 


INTEGRATED  INFORMATION  SYSTEM 
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META  MACHINE  [SYMBOLIC  EXAMPLE  OF  OPERATION) _ (3 
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The  META-MACHINE  simulates  in  software  a  pseudo-computer  where  sequentiid  instructions  are  retrieved  from 
a  5-domain  program  relation  table.  These  domains,  consisting  of  a  primary  address,  a  state  function  code,  a 
forwarding  address,  tui  execution  string,  and  a  functional  clustering  attribute,  can  be  arranged  and  manipulated  by  the 
System  Administrator  with  standard  INGRES  database  management  commands.  Meta  instructions  in  the  execution 
string  contain  prompting  statements  to  be  forwarded  to  a  user,  confirmations,  command  strings  for  the  underlying 
database  management  system,  or  instructions  for  other  programs. 

The  user  is  prompted  by  a  uniform  command  language.  His  requests  are  matched  agfunst  the  available  options  in 
the  hashed  primary  address  domain  of  the  T1S  program  relation,  and  are  exchanged  and  assembled  into  corollary 
instruction  strings  for  whatever  program  is  to  be  executed.  The  transmitted  of  the  prompting  strings  to  the 
appropriate  user  at  a  terminal,  or  the  transfer  of  an  instruction  stack  to  a  program,  is  carried  out  by  a  40,000-byte 
program  residing  in  core.  The  17  state  functions  determine  the  destination  and  type  of  string  delivery.  The  following 
figures  illustrate  this  functional  procedure.  During  man-machine  interaction  with  the  user,  while  the  command 
execution  strings  are  being  assembled,  partial  transmission  to  the  target  program,  local  or  remote,  can  be  made  in 
anticipation  of  the  full  command  string.  In  this  manner,  and  by  anticipating  permissible  attributes  and  parameters 
through  the  functional  command  cluster  attribute,  considerable  speed  can  be  gained. 

The  META-MACHINE  approach  uses  one  data  management  technique  for  access  to  all  information,  data,  and 
executive  capabilities.  This  includes  control  of  access  rights  by  user  for  each  of  the  major  databases,  the  individual 
relations  within  each  database,  the  user  commands,  the  preformatted  reports  and  graphical  displays,  and  especially  the 
information  about  other  centers  to  which  it  automatically  connects. 

The  flexibility  of  the  META-MACHINE  has  been  demonstrated  by  the  successful  integration  of  bibliographic, 
numeric,  project-oriented,  administrative,  and  budgetary  information  or  data  files  in  one  system.  The  report  writer, 
exceeding  the  ANSI  COBOL74  capabilities,  is  also  driven  by  instructions  from  the  META-MACHINE.  Using  this  unified 
technique,  the  report  writer  was  written  in  2-3  man-weeks.  Delivery  of  a  report  writer  programmed  by  conventional 
techniques  had  been  estimated  to  require  six  months  and  a  minimum  of  $30,000. 

The  extensibility  of  the  META-MACHINE  was  demonstrated  when  in  three  days  we  converted  an  electric  car 
performance  prediction  model  from  batch  to  interactive  use.  This  model,  developed  by  the  Transportation  Systems 
Research  program  at  LLNL  for  the  DOE  Office  of  Energy  Systems  Research,  prompts  the  user  for  different  scenarios: 
vehicle  type,  battery  selection,  time  period,  driving  cycle,  etc.,  and  compares  the  calculated  result  with  the 
performance  of  vehicles  equipped  with  internal  combustion  engines.  Fifteen  other  models  are  now  in  use  for  different 
vehicles  such  as  hybrids,  using  flywheels,  batteries,  and  other  energy  storage  methods.  The  user  of  the  interactive 
models  can  enter  his  own  technical  values  if  he  wishes  to  explore  conditions  that  are  not  part  of  the  prepared  options. 
Validation  of  these  models  is  greatly  simplified  by  having  the  input  data  stored  in  individual  database  relations  from 
which  they  can  be  extracted,  manipulated,  compared,  and  plotted.  Results  of  these  calculations  can  be  saved  for 
parameter  studies  and  time-series  analyses. 


Multilingual  interaction  with  users  is  possible  by  simple  translation  of  the  user  command  language,  in  the 
program  database,  into  languages  other  than  English.  The  corollary  instruction  strings  to  the  underlying  database 
management  system,  or  to  other  programs,  remain  the  same. 

Different  command  langua;  --.s  can  be  activated  by  the  extensible  TIS  database  language  by  translating  the 
instructions  from  the  available  INGRES  command  language  into  those  of  a  different,  local  or  remote,  database 
management  system  or  execution  program.  Where  additional  options  are  offered  by  a  new  system,  additional  prompts 
are  appended  to  the  program  relation  with  the  appropriate  addresses  and  states  in  the  first  domains.  In  this  case,  the 
user  interaction  strings  remain  the  same. 

Electronic  mail  has  been  augmented  by  integrating  the  local  and  ARPANET  directories  into  one  mail  system  with 
many  options.  Automatic  file  transfer  between  word  processors  and  TIS  has  been  used  since  1980. 

Networking  became  operational  with  the  incorporation  of  the  NBS  Network  Access  Machine  software  into  the 
TIS  system.  Fully  automated  and  transparent  connections  are  made  to  several  host  machines  over  the  ARPANET  and 
by  telephone  dial-out  These,  include  DOE/RECON,  NASA/RECON,  MACSYMA  at  MIT,  NIC  at  SRI,  LBL-UNIX,  SERI, 
LBL-VAX,  and  the  DOE  Alternative  Fuels  Center  in  Oklahoma,  among  22  other  systems.  To  connect  to  DOE/RECON, 
for  example,  the  user  simply  specifies  the  target  by  requesting:  connect  doe  recon.  The  progress  of  the  connection  is 
displayed  and  requires  between  7  and  45  seconds,  depending  on  whether  ARPANET  or  telephone  lines  are  used.  In 
some  cases  we  activate  at  the  remote  host  the  needed  resources  immediately,  in  which  case  the  target  name  is  made 
identical  to  the  resource,  e.g.:  connect  macsyma. 

At  the  present  time,  we  offer  over  64  major  information  resources  that  describe  in  a  hierarchical  manner 
material  properties  for  energy  storage  materials,  technology  characterization  data,  systems  data  that  are  aggregates 
of  components  used  in  conjunction  with  energy  storage  applications,  and  a  number  of  interactive  models.  Econometric 
files  were  added  to  permit  market  penetration  studies.  The  results  can  be  viewed  in  over  300  predefined  tabular  and 
graphical  display  formats. 

In  summary,  the  META-MACHINE  and  its  implementation  for  the  Technology  Information  System  offers 
administrators  and  project  staff  a  central  focal  point  for  communication  and  information  management,  more  fully 
explained  in  its  capabilities  later  on  in  this  report.  Conceptually,  it  functions  as  the  basis  for  a  stand-alone,  intelligent 
gateway  computer  with  capabilities  of  linking  any  user  to  local  or  remote  sources  of  information,  permitting  him  to 
extract  and  analyze  the  result:,  and  to  share  them  with  others.  To  accommodate  the  larger  traffic  in  communications 
and  personal  information  mandgen..nt,  we  are  planning  to  replace  INGRES  by  installation  of  the  directly  compatible 
Intelligent  Database  Machine,  1DM-500  by  Britton  Lee,  Inc.®8  This  is  also  expected  to  off-load  the  CPU  of  the 
PDP-U/70  and  to  improve  throughput  by  t  least  one  order  of  magnitude. 

"Extraction  of  Intelligence" 


Facts,  by  themselves,  do  not  convey  understanding  or  knowledge.  Their  comparison  with  past  experience, 
correlation  with  other  data  and/or  predictions,  and  display  in  multidimensional  color  images,  however,  can  be 
meaningful.  As  we  are  approaching  the  1980s  and  are  faced  with  a  very  large  volume  of  factual  data,  it  becomes 
increasingly  necessary  that  we  filter  the  data  and  extract  new  insight. 


In  the  full-text  retrieval  service,  NTIS  has  initiated  a  new  type  of  automated  indexing  of  citations  with  their 
Total  Microfiche  SDI  Service,  also  known  as  "Selected  Research  in  Microfiche"  (SRIM).  Since  the  fiche  are  being 
updated  quarterly,  the  subscriber  has  always  a  reasonably  n>-to-date  and  comprehensive  listing  of  full-length 
publications  in  his  field  of  interest,  indexed  by  author,  subject,  corporate  author,  contract  grant  number,  accession 
report,  and  title  —  a  really  remarkable  service  making  good  use  of  the  photographic  storage  medium.  This  should 
serve  many  users  well  in  the  1980s. 


In  the  bibliographic  online  information  services,  both  Federal  and  commercial  citations  are  still  being  delivered 
to  the  end-user  "en  masse"  as  raw  material.  It  was  pointed  out  already  in  1971  that  much  could  be  done  with  the 
retrieved  results  while  they  were  still  on  the  host  computer.  These,  and  related  aspects,  were  discussed  in  some  detail 
at  the  forum  on  Interactive  Bibliographic  Systems  held  at  Gaithersburg,  MD,  October  4-5,  1971,  under  the 
chairmanship  of  Charles  Meadow  (cf.  pp.  173-174): 

o  Indexes  by  author,  subject,  category,  etc. 
o  Statistics  by  country,  organization,  etc. 

o  Cross-correlations  of  data  elements 

o  Graphical  display  of  results 

o  Text  analysis 
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Of  these,  the  first  is  always  used  in  books.  Indexes  are  sorely  needed  in  bibliographic  online  information 
retrieval.  At  LLNL,  we  use  the  capabilities  of  the  integrated  Technology  Information  System,  and  offer  online, 
interactive  commands  by  which  the  above  options  can  be  carried  out  by  the  user  immediately  after  completion  of  a 
search.  The  technique  has  been  adapted  especially  to  DOE/RECON  and  will  be  extended  to  other  databases  and 
information  systems  as  required  and  where  permissible.  The  user  can  thus  create,  review  online,  and  annotate  his 
personal  or  programmatic  bibliography  and/or  library  system. 


Text  analysis,  although  not  new,  has  some  fascination  because  it  permits  us  to  recognize  significant  migration  or 
cross-fertilization  of  new  ideas  in  R&D  work.  Let  us  take  for  example  the  laser  field.  When  one  creates  an  authority 
list  of  all  single  and  multiterm  expressions  derived  from  titles  and  abstracts,  one  arrives  at  a  reasonably  stable,  slowly 
changing  body  of  descrptive  terms.  Newly  appearing  terms  can  easily  be  set  aside  and  used  to  mark  citations  for 
closer  inspection.  It  is  thus  possible  to  find  the  citation  where  laser  beams  were  first  used  to  weld  the  retina  in  an 
eye.  In  other  words,  one  can  filter  out  those  citations  that  somehow  do  not  follow  the  common  pattern  of  previous 
descriptive  indexing  or  word  usage.  They  contain  either  typographical  errors  or,  potentially,  literary  pearls.  Other 
examples  quickly  come  to  mind.  The  techniques  for  doing  this  type  of  analysis  have  been  known  since  the  time  when 
inverted  tables,  or  secondary  indexes  were  introduced  for  machine-aided  look-ip  of  facts.  They  could  be  used  in  the 
1980s  as  a  new  means  of  SDI  service,  signaling  unusual  and  different  publications. 

I  would  like  to  show  an  additional  example  of  higher  intelligence  recently  extracted  by  Japanese  technologists 
from  a  bibliography  on  nuclear  criticality  experiments  published  at  LLNL  last  year.59  This  publication  was  released 
in  three  volumes  with  accurate  computer-generated  concordances.  It  served  as  the  basis  for  graphical  extraction  of 
higher  intelligence,  e.g.: 
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One  of  the  complex  pie  charts  shows  the  number  of  reports  published  from  each  organization.  Another  depicts 
the  type  of  experiments  carried  out  at  the  different  facilities,  and  finally,  cumulative  totals  of  experiments  with 
different  fuel  enrichments  are  shown  as  a  function  of  time.  Although  these  illustrations  were  probably  carried  out  by 
manual  inspection  of  the  factual  data  contained  in  the  bibliography,  computer  programs  could  be  written  to  do  the 
same  routinely. 
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The  extraction  of  higher  intelligence  from  numeric/structured  data  files  is  common  practice.  Numbers  lend 
themselves  directly  to  statistical  analysis  and  graphical  display.  General  and  specialized  programs  are  offered 
commercially  or  are  under  development  at  universities  and  Federal  organizations.  But,  what  is  needed  is  an  adaptation 
of  some  of  these  powerful  tools  for  general  use  in  a  self-guided  manner.  As  mentioned  earlier,  in  the  field  of  standard 
retrieval  of  bibliographic  information,  we  see  a  shift  from  the  information  specialist  as  an  intermediary  to  the 
end-consumer.  The  analysis  and  graphical  display  programs,  however,  are  more  complex  and  will  require  in  the  1980s 
information  specialists  well  familiar'  with  the  manipulation  of  numerical  data  and  mathematics.  The  direct  creation  of 
electronic  visuals  (35mm  slides,  viewgraphs,  films,  etc.)  by  computer  is  the  newest  exploding  technology  for  the  1980s. 

An  excellent  example  of  what  can  be  done  for  technologists  with  advanced  tools  is  given  by  the  Integrated 
Programs  for  Aerospace-Vehicle  Design  (IPAD).  Early  stages  of  the  software  are  in  use  at  Boeing  and  several  other 
aerospace  firms.  The  system  manages  the  total  flow  of  information  and  data  for  engineering  and  manufacturing.  The 
National  Aeronautics  and  Space  Administration  (NASA)  is  making  major  strides  by  sponsoring  this  development. 

Another  example  is  tound  in  color  graphics  programs.  Beautiful  illustrations  of  double-helical  DNA  molecules 
and  those  of  the  Tomato  Bushy  Stunt  Virus  have  graced  the  front  covers  of  professional  journals.60  They  indicate 
the  pictorial  power  in  store  for  us  in  years  to  come.  At  the  present  time,  these  programs  are  still  specialized  and  not 
suitable  for  direct  linking  with  databases.  Illustrations  are  usually  produced  manually  by  one-at-a-time  adaptations  to 
recently  measured  parameters,  or  extractions  from  an  existing  database.  What  is  needed  here,  too,  is  a  user-oriented 
tool  by  which  anyone  with  access  to  the  CIS  databases,  for  example,  could  search,  retrieve,  display,  and  zoom  in  on  the 
molecular  structures  of  interest  to  him.  A  number  of  these  sophisticated  tools  are  in  the  public  domain  and  will 
hopefully  become  available  in  simplified  form  as  well. 


Cutaway  view  showing  inner  surface  of  protein  coat  of  Tomato  Bushy  Stunt  Virus,  as  represented  by  computer  graphics 
system  under  development  at  National  Resource  for  Computation  in  Chemistry  (NRCC)  and  LLNL.  Each  large  sphere 
represents  one  protein  molecule,  small  spheres  are  linking  arms"  of  the  protein  molecules.  Close-i4>  view  focuses  on 
three  of  the  protein  molecules  in  the  TBS  virus,  showing  arms  that  link  molecules. 


I  would  like  to  conclude  this  section  with  a  remark  about  pattern  recognition.  This  powerful  fact  retrieval 
technique  has  been  used  primarily  with  numeric  data,  but  there  is  no  reason  why  it  could  not  be  used  with  structured, 
coded  data  or  even  with  text.  The  following  examples  may  serve  this  purpose: 

The  San  Diego  Police  Department  received  a  grant  in  1975  from  the  Law  Enforcement  Assistant  Administration 
(ARJIS)  to  plan  for  a  computerized  data  processing  system  that  could  serve  all  of  San  Diego  County.  One  of  the  goals 
was  the  implementation  of  an  automated  crime  analysis  capability.  The  Lawrence  Livermore  National  Laboratory  was 
requested  to  assist  ARJIS  project  personnel  under  auspices  of  the  National  Technology  Transfer  Program.  The  results 
were  impressive.  Pattern  recognition  with  the  LLNL-developed  PATTER  program  could  identify  correctly  those 
crimes  where  the  success  of  investigation  and  conviction  was  highest.  Such  an  approach  can  be  used  for  similar 
multivariate  problems  that  are  difficult  to  solve  by  statistical  or  conventional  means.  In  another  case,  the  same 
pattern  recognition  program  was  applied  successfully  to  the  primary  data  describing  elements  on  a  wall  chart.  It 
predicted  correctly  the  acidic,  basic,  and  graduated  amphoteric  characteristics  of  the  elements.  Pattern  recognition 
has  particular  importance  for  work  of  unusual  complexity  and  offers  powerful  solutions  to  problems  with  limited 
manpower  and  budgets. 
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MODELING 


•  Theory  i(  often  insufficient 

•  Experimental  measurements 

•  Insufficient  dete  for  statistical  modeling 

Pattern  recognition  tachnlquat  can  be  used  to  oonetmet 
•amNmpericel  models 


References :  C.F.  Bender  et  els. . 

Pattern  Recognition,  Jour.  A«.  Chee.  Soc.  94,  563?,  197? 
Pattern  Recognition  6  Crime  Analysis,  UCID-17224,  1976. 


"The  Intelligent  Gateway  Computer" 


Local  office  networks  are  starting  to  be  interconnected  with  computers,  remote  information  centers,  and  other 
resources.  The  results  are  not  always  simple  or  elegant,  but  they  work  and  are  prompted  by  the  availability  of 
whic!f.  are  ref!'°v,in8.  distance-dependent  communication  costs.  These  developments  magnify 
the  opportunities  for  expanding  available  information  resources  and  computer  power  enormously.  However,  to 


MANUAL  MODE  OF  OPERATION 


circumvent  dissimilarities  among  different  resources  and  communication  protocols,  a  coordinated  approach  is  needed. 
Ideally,  it  should  permit  a  group  of  users  to  interact  with  each  other,  and  with  the  rest  of  the  world,  in  a 
cost-effective  and  practical  manner. 
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The  trend  toward  this  direction  has  been  apparent  for  some  time.  Computer  terminals  developed  gradually  from 
mere  keyboards  to  being  imbued  with  microprocessor  brains  and  memories.  The  communication  capabilities  of  these 
intelligent  terminals,  however,  are  usually  Intended  to  connect  only  to  one,  or  to  a  few,  remote  computers  or 
resources.  Terminals  of  this  type  are  offered  for  $15K  to  $25K,  where  some  would  be  considered  to  be  in  the  class 
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of  minicomputers.  In  most  cases,  the  manufacturers  provide  capabilities  that  are  upward  compatible  only  within  their 
line  of  hardware.  This  prompts  each  owner  and  user  of  such  an  intelligent  terminal  to  "upgrade"  with  even  greater 
investments  in  time  and  money  without  necessarily  coming  closer  to  a  more  flexible  and  automated  solution. 

Common  communication  gateway  computers  also  provide  only  a  partial  solution.  As  a  rule,  they  are  not  very 
intelligent.  They  connect  and  may  exert  some  control  and  perhaps  do  some  accounting,  but  they  do  not  provide 
translation  of  incompatible  protocols  among  electronic  text  processors;  they  do  not  permit  users  to  determine  their 
capabilities  or  allow  extraction  and  saving  of  information  and  data  from  other  sources. 

What  is  needed  is  a  stand-alone,  intelligent  gateway  computer.  It  should  contain  a  master  index  to  the  available 
resources,  connect  authorized  users  automatically,  translate  protocols  and  formats,  permit  the  aggregation  of 
reasonable  amounts  of  extracted  information  and  data,  and  offer  a  resident  library  of  software  tools  by  which  these 
data  can  be  post-processed,  analyzed,  and  displayed. 


An  Intelligent  Gateway  Computer 


Accan  Commands  Pott -processing 

control  &  date  library 


At  LLNL,  we  were  confronted  with  a  simitar  problem  in  1975  when  the  U.S.  Department  of  Transportation  asked 
us  to  study  the  concept  and  implementation  of  a  "Transaction  Controller"  that  would  permit  their  analysts  to  interact 
with  some  26  different  computer  centers  where  the  statistics  of  passenger  and  cargo  traffic  by  ship,  air,  trains,  and 
trucks  are  kept.  Our  work  evolved  into  the  design  and  implementation  of  the  META-MACHINE  user  interface  for  the 
Technology  Information  System,  described  previously.  Presently,  we  are  generalizing  our  experience  in  this  field  with 
the  concept  of  a  stand-alone,  portable  system. 

The  "Intelligent  Gateway  Computers"  would  serve  a  group  of  users  who  could  retain  their  old  terminals  since  the 
intelligence  can  now  be  made  to  reside  in  a  time-shared,  interactive  minicomputer.  A  possible  configuration  might  be 
a  PDP-VAX-750  machine,  coupled  with  an  IDM-500  back-end  database  machine,  and  a  versatile  communications  front 
for  automated  dial-out,  dial-in,  and  network  access.  Our  development  work  at  LLNL  points  toward  this  direction  and 
uses  the  META-MACHINE  as  the  extensible  and  flexible  interface: 


CONCEPTUAL  VIEW  OF  A  FUTURE  "TIS"  INTELLIGENT 
GATEWAY  COMPUTER 


4.  FACT  RETRIEVAL  IN  THREE  DIMENSIONS  AS  A  FUNCTION  OF  TIME 


This  is  probably  the  ultimate  goal  of  information  management.  But,  as  with  speech  and  books,  the  technology 
advances  first  to  a  state  where  it  can  faithfully  store  and  duplicate  the  information.  Management  of  information,  and 
extraction  of  higher  levels  of  intelligence,  follows  later  on.  Here,  I  would  like  to  draw  your  attention  to  two  unusual 
three-dimensional  display  methods,  those  of  holography  and  stero-vision.  Both  were  recently  reviewed  by  Dr.  Donald 
L.  Vickers  who  is  leading  the  graphics  groig>  at  the  Lawrence  Livermore  National  Laboratory.  The  technical  details 
and  illustrations  are  extracts  of  his  presentation81  and  should  give  you  an  immediate  insight  to  the  potential  of  this 
Upcoming  technology  for  your  future  work. 

"Crystallographers,  for  example,  are  encumbered  with  more  than  their  share  of  the  age-old-problem  of  trying  to 
perceive  three-dimensional  information.  Consequently,  they  are  more  painfully  aware  than  most  of  the  irony  of  living 
in  a  three-dimensional  world  while  having  to  communicate  with  fewer  than  three  dimensions.  Even  the  computer,  with 
its  great  speed  and  ability  to  solve  problems  based  on  data  with  many  dimensions,  usually  ignores  the  potential  of 
communicating  with  the  user  in  at  least  three  dimensions  when  it  comes  to  graphically  presenting  the  results. 
Three-dimensional,  computer-stored  information  nearly  always  is  transformed  into  two  dimensions  and  then  plotted  on 
paper,  film,  or  the  face  of  a  cathode  ray  tube  (CRT).  There  are,  however,  several  displays  on  which  three-dimensional 
computer  data  actually  appear  three-dimensional  and  steroscopic.  Two  such  displays  are  the  integral  hologram  and  the 
head-mounted  display. 

"Integral  Holography11 

Integral  holography  is  several  steps  beyond  the  holography  most  are  familiar  with.  Many  scientists  have  seen  a 
standard  transmission  hologram  and  the  three-dimensional  image  which  it  forms  when  illuminated  with  the 
monochromatic  light  of  a  laser.  Many  have  also  seen  the  cylinder  hologram,  a  cylinder  of  processed  holographic  film 
which,  when  illuminated  with  a  laser,  recreates  a  three-dimensional  image  inside  the  cylinder.  As  the  cylinder  is 
rotated,  the  object  inside  also  appears  to  rotate  and,  thus,  one  may  view  both  the  back  and  front  of  the  holographic 
image.  Now,  try  to  imagine  a  cylinder  hologram  that  produces  a  holographic  image  which  not  only  rotates  as  the 
cylinder  is  rotated,  but  which  also  moves  or  deforms --this  is  an  integral  hologram.  (Refer  to  figures  on  the  next  page. 

But  even  more  amazing  than  a  time-varying  holographic  image  is  that  the  source  of  illumination  for  viewing  is  an 
ordinary  incandescent  light  bulb  and  not  a  laser.  An  integral  hologram  is  not  made  from  a  single  exposure  as  it  is  in  an 
ordinary  cylinder  nologram,  rather,  it  is  made  up  of  2160  individual  slit  holograms  which  are  integrated  by  the 
observer's  eye  to  form  what  appears  to  be  a  single  holographic  image.  The  2160  slit  holograms  are  made  from 
consecutive  frames  of  a  1080-frame  35-mm  black  and  white  movie  film  by  a  process  described  in  the  following 
paragraph.  The  holographic  film  containing  the  slit  holograms  is  taped  to  the  outside  of  a  plastic  cylinder  about  40  cm 
in  diameter  such  that  each  slit  hologram  subtends  1/6°  of  arc  on  the  cylinder.  As  the  cylinder  is  rotated,  successive 
slit  holograms  come  into  view,  giving  to  the  holographic  image  an  illusion  of  motion  much  like  that  in  the  35-mm 
movie  from  which  the  integral  hologram  was  made.  The  1080  frames  correspond  to  a  45-second  movie  shown  at  a  rate 
of  24  frames  per  second. 
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Schematic  view  from  above  ihowta*  bow  various  slit  holograms,  though  separate  in  time.  Schematic  diagram  of  the  main  elements  used  by  the  Mohiptex  Company 

form  a  single  "Integrated"  holographic  image.  When  the  cylinder  routes,  the  eye  does  not  notice  ^  integral  holograms 

any  separation  between  the  stit  boktp-um 


The  slit  holograms  are  24  cm  high  and  0.7  cm  wide  on  the  I.25-m-iong  holographic  film.  They  are  made  by 
shining  a  7-mW  HeNe  laser  through  one  frame  of  the  35-mm  movie  at  a  time,  passing  the  resultant  beam  through  a 
large  cylindrical  lens,  and  combining  it  with  a  reference  beam.  The  interference  patterns  thereby  generated  are 
captured  on  holographic  film  which  is  advanced  about  0.5  mm  between  exposures,  thus  causing  some  overlap  among 
adjacent  slit  holograms.  In  the  "standard"  grade  integral  hologram,  the  apparent  resolution  of  the  holographic  image  is 
improved  by  exposing  each  frame  of  the  35-mm  movie  twice.  This  is  why  the  1080  frames  of  the  35-mm  movie  result 
in  2160  slit  holograms.  A  coarser-looking  "proof"  grade  hologram  can  be  made  by  advancing  the  holographic  film  1  mm 
and  exposing  each  35-mm  frame  once. 

Two  factors  combine  to  allow  the  use  of  white  light  rather  than  laser  light  for  illumination  of  the  hologram. 
First,  a  cylindrical  lens  instead  of  a  diffusion  screen  is  used  in  the  exposing  process.  Second,  the  subject  of  the 
hologram  is  a  two-dimensional  piece  of  35-mm  film  and  does  not  require  a  laser  to  "unlock"  the  third  dimension.  In 
fact,  if  monochromatic  light  were  used  one  would  not  see  the  whole  image  but  just  a  small  hoop-shaped  band  of  it  and 
the  b and  would  move  tg>  and  down  showing  different  parts  of  the  holographic  image  as  one  raised  and  lowered  one's 
head.  As  it  is,  the  holographic  film  acts  as  a  diffraction  grating  for  the  illuminating  white  light  so  that  the 
holographic  image  one  sees  contains  the  colors  of  the  rainbow,  ranging  from  red  at  the  top  to  violet  at  the  bottom.  As 
one  moves  up  and  down  while  looking  at  the  holographic  image  the  rainbow  of  colors  appears  to  shift.  Two  restrictions 
are  imposed  on  the  nature  of  the  illuminatint  white  light;  the  bulb  must  be  unfrosted,  and  the  filament  must  be  as 
near  as  practical  to  a  single  vertical  line. 

To  date,  the  Lawrence  Livermore  National  Laboratory  (LLNL)  jus  produced  five  computer-generated  movies  of 
scientific  interest  from  which  integral  holograms  have  been  made.82  One  shows  s  nonrotating  disk  which,  as  the 
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cylinder  is  rotated,  warps  according  to  the  phase  change  in  the  beam  of  one  of  our  large  lasers.  Another  shows  a 
rotating  sheet  which  "grows  into  a  mountain,"  the  shape  and  altitude  of  which  indicate  the  density  of  X  rays  emitted 
from  a  laser-bombarded  target.  Two  of  the  five  holograms  are  of  direct  interest  to  crystallographers.  The  first  shows 
a  rotating  tetraglycine  molecule  which,  as  it  rotates,  changes  from  a  "ball-and-stick"  to  a  "filled-spaee"  model.  The 
second  represents  a  rotating  calcite  crystal  showing  first  the  unit  cell,  then  adding  a  clevage  rhombohedron,  and 
finally  showing  the  free  rotation  of  the  oxygen  atoms  about  the  carbon  atoms  within  the  carbonate  group,  as  it 
actually  occurs  at  extremely  high  temperatures.  Refer  to  figures  on  preceding  page. 

The  concepts  of  integral  holography  are  not  new.  Both  the  ideas  of  making  composite  holograms  and  of  making, 
from  a  collection  of  two-dimensional  perspectives  (movies),  holograms  which  can  be  viewed  with  white  light  have 
previously  been  published.  Also  in  the  literature  are  articles  about  how  to  make  full-color  holograms.  What  is  new  is 
the  working  combination  of  these  concepts  and  the  commercial  availability  of  the  integral  hologram  such  that  any 
interested  researcher  may  take  advantage  of  it. 

Applications  of  integral  holography  to  the  areas  of  advertising  and  publicity  are  obvious.  The  integral  hologram 
is  also  potentially  very  useful  to  both  scientific  education  and  research.  Imagine  the  benefit  of  showing  students  not 
just  a  rotating  three-dimensional  model  of  a  molecule,  but  a  rotating  three-dimensional  holographic  model  that  also 
shows  how  molecules  and  atoms  combine  during  a  chemical  reaction.  Already  integral  holograms  have  been  used  with 
resounding  success  as  visual  aids  for  technical  papers  in  the  area  of  chemistry,  crystallography,  and  computer  science. 
When  used  as  an  aid  to  help  the  researcher  better  understand  the  structure  of  some  molecule  or  crystal  he  is  working 
with,  the  integral  hologram  is  more  accurate,  more  versatile,  less  expensive,  and  requires  much  less  effort  to  create 
than  most  complicated  ball-and-stick  models. 

Up  to  now  the  integral  holograms  we  have  dealt  with  have  been  limited  to  360°  views.  (Actually  the 
tetraglycine  and  calcite  integral  holograms  may  be  considered  to  have  720°  of  view  since  the  molecules  rotate  twice 
for  every  rotation  of  the  holographic  cylinder.)  Researchers,  using  a  process  which  requires  a  monochromomatic  point 
source  for  illumination,®®  made  an  experimental  integral  hologram  on  a  piece  of  holographic  film  70  mm  wide  and 
about  7  m  long  which  they  spiraled  around  a  plastic  cylinder  and  were  thus  able  to  view  a  much  longer  movie 
sequence.  When  the  process  of  making  an  integral  hologram  from  a  movie  becomes  more  automated,  it  should  be 
possible  to  use  source  and  take-up  reels  which  move  a  full-length  movie's  worth  of  holographic  film  around  a  rotating 
plastic  viewing  cylinder. 

Fact  retrieval  by  means  of  integral  holography  for  most  of  us  may  still  be  some  time  in  the  future.  However,  we 
recognize  that  these  advanced  capabilities  are  coming  up  and  have  to  be  linked  with  the  data  in  a  user-oriented 
environment.  This  is  where  the  producers  of  data  and  information  specialists  come  in.  Currently,  there  is  probably 
only  one  place  in  the  world  with  the  facility  for  making  a  white-light  integral  hologram. 

"Stereo  Vision  of  Facts" 

We  just  learned  how  computer-generated  integral  holograms  permit  us  to  view  complex  data  as 
three-dimensional  movies.  But  we  remained  passive  observers  of  a  virtual,  tantalizing  imagery.  To  remember  these 
images,  to  explore  the  meaning  of  their  valleys  and  peaks,  we  still  would  have  to  use  our  minds  to  ponder  their 
implications  from  different  points  of  view.  Would  it  not  be  nice  and  informative  to  be  able  to  walk  into  the  data, 
being  able  to  touch  them,  to  "feel"  their  irregularities,  and  to  smcothen  them  by  hand.  We  may  have  done  some  of  it 
in  an  abstract  sense  mathematically,  but  must  have  envied  the  sculptor  who  can  project  the  images  of  his  inward  eye 
into  reality  and  mold  it  by  hand. 

The  technology  to  do  this  in  information  management  is  available.  It  has  been  explored  during  the  seventies.  I 
am  bringing  it  here  to  your  attention  because  technical  difficulties  and  costs  that  may  have  prevented  its  widespread 
application  in  earlier  years  have  decreased  with  the  advent  of  inexpensive,  powerful  microcomputers  that  feed  on  data 
from  larger  stores.  Stereo-vision  of  facts  is  made  possible  by  a  head-mounted,  three-dimensional  viewer.  It  was  first 
built  by  Dr.  Ivan  E.  Sutherland  at  Harvard  College.®’ 

The  display  itself  uses  refreshed  CRT  technology,  but,  unlike  most  CRT  displays,  this  one  is  worn  on  the  head 
like  a  pair  or  spectacles.  Pictures  drawn  by  a  computer  on  the  two  CRTs  are  presented  to  the  person  wearing  the 
spectacles,  the  observer,  as  a  virtual  image  which  appears  to  be  made  of  glowing  wire  and  is  superimposed  on  the 
observer's  field  of  view  in  such  a  way  that  it  seems  to  float  in  space.  The  viewer  and  a  virtual  map  of  the  United 
States  are  shown  on  the  next  page. 

The  computer-generated,  or  "synthetic,"  objects  can  be  programmed  to  remain  stationary  as  the  observer  walks 
around,  between,  or  even  through  them.  They  can  also  be  made  to  change  in  time  as  the  observer  stands  still  or  as  he 
evokes  a  response  by  his  body  motion  or  controls.  Both  a  wand  and  light-studded  gloves  have  been  used  to  reach  out 
and  interact  with  the  synthetic  objects,  allowing  an  observer  to  touch,  connect,  deform,  erase,  and  even  to  create 
them  by  "drawing"  in  space.  The  unique  three-dimensional  environment  created  by  the  head-mounted  display  comes 
from  the  smooth  teamwork  of  many  special  pieces  of  hardware.  Refer  to  Figures. 

The  head  set  which  actually  presents  the  synthetic  object(s)  to  an  observer  has  two  2-cm-dlameter  CRTs 
mounted  where  the  temple  pieces  would  be  located  in  normal  eyeglasses.  A  picture  drawn  on  the  left  CRT,  for 
instance,  is  reflected  off  a  mirror,  through  a  lens,  onto  a  half-silvered  prism,  and  from  there  into  the  observer's  left 
eye.  The  prism  allows  the  observed  to  see  the  real  objects  in  the  surrounding  room  plus  the  virtual  images  or  synthetic 
objects  drawn  by  the  computer.  Stereoscopic  viewing  is  possible  by  sending  different  pictures  to  the  right  and  left 
eyes.  In  order  for  the  computer  to  make  a  synthetic  object  appear  stationary  as  the  observer  moves  abott,  the 
computer  must  monitor  the  position  and  orientation  of  the  observer's  head.  This  is  done  with  a  counterbalanced 
head-position  sensor  which  consists  of  a  2-m-long  telescoping  tube  attached  through  universal  joints  to  both  the  head 
set  and  pivotal  reference  point  on  the  ceiling.  The  cyclic  chain  of  events  from  the  reading  of  the  head-position  sensors 
to  the  drawing  of  the  vectors  on  the  CRTs,  takes  place  at  the  rate  of  at  least  20  times  a  second  in  order  to  avoid 
flicker  or  jerking  on  the  CRT.  Future  improvements  may  well  use  nonmechanical  sensors  to  establish  the  head  position 
of  the  viewer.  The  interaction  of  different  components  is  shown  below. 


The  Multiplex  Company  of  San  Francisco,  which  has  applied  for  a  patent  on  their  process.  You  may  wish  to 
address  your  inquiries  to  them  or  to  Don  Vickers,  LLNL  Graphics  Group,  L-73,  Lawrence  Livermore  National 
Laboratory,  P.O.  Box  808,  Livermore,  CA  945S0,  USA. 


Observer  wearing  the  heed  set  end  harness  of  the  head-mounted  display.  The  lower 
part  of  the  heed  position  sensor  shaft  and  the  encoders  for  the  lower  universal  joint 
are  visible  at  the  top  of  the  figure. 
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The  wand,  the  most  frequently  used  device  for  interaction  with  the  three-dimensional  synthetic  objects,  is 
equipped  with  four  buttons,  a  switch,  and  a  potentiometer.  These  signaling  devices  allow  the  observer  to  "tell"  the 
computer  whether  he  is  drawing,  erasing,  or  extracting  data.  The  computer  tracks  the  position  of  the  wand  by 
monitoring  the  length  of  line  attached  to  take-up  reels  on  the  ceiling.  These  three  spring-loaded  lines  also  serve  to 
counterbalance  the  weight  of  the  wand  and  the  cord  attached  to  it.  (Again,  wireless  position  sensors  can  readily  be 
envisioned  to  provide  mobility  to  the  user.) 

It  was  quickly  evident  that  four  buttons  were  insufficient  to  do  all  the  things  one  might  like  to  do  with  such  a 
wand,  so  a  wall  chart  divided  into  quadrants  was  designed  to  multiply  the  effect  of  the  buttons.  Each  quadrant  of  the 
chart  corresponds  to  a  different  mode  of  operation,  and  each  mode  redefines  the  meaning  of  the  four  buttons.  Of 
course,  only  an  observer  wearing  the  head  set  could  see  the  confirmation  signaled  by  the  computer  to  the  viewer  in  the 
form  of  a  virtual  cross.  Thus  the  wand  allowed  interaction  not  only  with  the  synthetic  objects,  but  also  with  any  other 
objects  in  the  room  which  were  "known"  to  the  computer. 

Though  the  head-mounted  display  was  built  at  Harvard  College,  it  was  taken  to  the  University  of  Utah  by 
Professor  Sutherland  just  a  week  or  two  after  its  completion.  At  the  University  of  Utah  it  served  as  a  research  tool 
for  several  graduate  students.  Currently,  however,  it  is  inoperable  and,  though  interest  persists,  funds  are  lacking. 
Clearly,  the  head-mounted  display  is  a  prototype.  It  is  not  as  yet  a  resource  available  to  present-day  cyrstallographers 
or  to  scientists  of  other  disciplines. 

Nonetheless,  the  head-mounted  display  has  been  used  with  modest  success  for  looking  at  three-dimensional 
mathematical  functions  such  as  a  3-4-5  Lissajous  Figure,  for  anlyzing  three-dimensional  electrocardiogram  data,  and 
for  simulating  arterial  structure  (one  was  able  to  walk  through  simulated  arteries).  It  has  been  used  to  "trace" 
three-dimensional  objects,  to  design  three-dimensional,  mathematically-defined  surfaces  and  to  simulate  what  a  blind 
person  would  "see"  if  an  electrode  array  were  implanted  in  his  visual  cortex.  To  our  knowledge,  it  never  has  been  used 
to  look  at  molecular  or  crystallographic  data. 

It  is  not  difficult  to  imagine  a  next-generation  head-mounted  display  system  which  would  show  colored  synthetic 
objects  as  solids  rather  than  as  outlined  drawings.  Such  a  display  might  be  equipped  with  remote  head-position  sensing 
and  with  telemetered  communication  both  from  the  wand  and  to  the  CRTs,  leaving  the  observer  completely  free  to 
move  around  with  no  wires  or  harnesses  to  worry  about.  Such  a  display  might  be  connected  to  a  computer  which  is 
processii^  x-ray  diffraction  data  and  could  be  used  to  look  online  at  a  crystal  structure  model  as  it  emerges  from  that 
data.  The  wand  could  be  used  to  single  out  certain  atoms  in  the  synthetic  image,  or  to  make  the  image  shrink  or  grow, 
allowing  the  observer  to  walk  around  it  or  even  inside  it. 

The  qualities  that  make  the  head-mounted  display  different  from  any  other  three-dimensional  graphical  I/O 
device  are  its  abilities:  (1)  to  produce  three-dimensional  images  which  are  constantly  updated  under  computer  control 
and  which  change  their  view  as  the  observer  walks  naturally  about,  just  as  real  objects  do,  and  (2)  to  superimpose  a 
three-dimensional  synthetic  object  on  real  three-dimensional  objects.  The  data  control  wand  adds  the  power:  (1)  to 
interact  with  these  three-dimensional  objects  using  natural  pointing  motions  rather  than  by  turning  knobs  or  pushing 
buttons,  and  (2)  to  interact  with  real  objects  In  the  surrounding  room.  And,  in  spite  of  being  limited  to  "wire-frame" 
drawings,  the  synthetic  objects  look  startlingly  real!  When  this  comes  about,  the  finding  of  facts  may  be  as  simple  as 
picking  ig>  a  synthetic  book  from  a  shelf,  checking  its  computer-based  index,  and  reading  its  virtual  pages  with  all  the 
quickness  of  the  human  mind,  and  projecting  their  data  into  space  for  a  personal  look  and  inspection.  Computer-aided 
fact  retrieval  will  then  have  become  an  extension  of  our  natural  environment.” 


View  of  computer  head-mounted,  three-dimensional  viewer,  hand-held  wand  for  data 
manipulation,  and  control  board  on  wall. 


Observer  pointing  with  the  wand  at  the  wall  chart.  The  head  mounted  display  and 
wand  r 'lowed  user  interaction  not  only  with  the  computer-drawn  objects  floating  in 
space,  but  also  with  real  objects  such  as  the  chart. 
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5.  THE  INTEGRATED  "TECHNOLOGY  INFORMATION  SYSTEM"  AT  LLNL 

Capabilities  of  the  Technology  Information  System  (TIS)  provide  nationwide  bibliographic  and  numeric  database 
management,  interactive  modeling,  electronic  communications,  and  distributed  networking.  These  capabilities  are 
self-guided  and  are  used  successfully  by  those  not  intimately  familiar  with  computers.  The  description  of  TIS  is  given 
here  as  an  example  of  an  operational,  intelligent  gateway  computer,  expected  to  serve  technologists  throughout  the 
1980s. 

TIS  is  a  new-generation,  dedicated  information  machine.  Programmatic  information  is  kept  on  TIS.  When 
additional  information  or  numeric-  data  are  needed,  TIS  connects  to  other  information  centers,  in  an  automated  and 
controlled  manner.  Authorized  users  simply  specify  the  target  name  of  the  desired  resource. 

In  addition,  since  much  of  the  daily  work  in  R&D  is  being  documented  on  electronic  word  processors,  we 
established  the  capability  of  linking  with  several  of  these  machines  for  transfer  of  information  and  data  to  and  from 
TIS.  Translation  of  formats  to  WANG,  LEXITROiT,  and  QYX  word  processors  is  carried  out  by  TIS  as  required.  Now 
that  commercial  hardware/software  have  started  to  appear  on  the  market,  we  are  planning  their  procurement  to  free 
our  resources  for  other  areas. 

Analysis,  synthesis,  and  post-processing  of  information  and  data  are  needed  to  speed  up  progress,  increase 
productivity,  and  transfer  technology.  TIS  gives  this  capability  to  each  user.  You,  the  user,  can  define  and  create 
your  own  data  files,  reports,  graphics,  and  communications  by  activating  self-guiding  routines.  Initially,  the  results  of 
your  work  belong  to  you  alone.  It  requires  a  permit  command  to  share  the  data  or  displays  with  someone  else,  a  group 
of  co-workers,  or  to  release  them  for  general  use. 

The  system  is  accessible  from  any  telephone  at  300  or  1200  bps,  over  the  ARPA  computer  network,  and  soon  also 
over  the  worldwide  TELENET/TYMNET  system.  FTS  and  WATS  lines  are  provided  for  cost-effective  use  of 
communications  and  convenience.  Here,  I  wish  to  highlight  the  major  capabilities  in  database  management,  modeling, 
and  communications.  I  hope  that  some  of  our  experience  may  be  useful  to  you  as  you  are  planning  your  integrated 
information  systems. 

The  Technology  Information  System  has  been  supported  by  the  DOE  Office  for  Energy  Systems  Research 
(DOE/ESR).  The  TIS  user  community  now  includes,  principally,  the  staff  of  the  Transportation  Systems  Research 
program  at  LLNL,  and  that  of  the  Seasonal  Thermal  Energy  Storage  program  at  Battelle,  PNL,  as  well  as  the  DOE/ESR 
staff.  There  are  about  170  authorized  users  throughout  the  country.  Electronic  communications  and  the  automated 
access  to  other  information  centers  is  available  to  all  users.  This  capability  is  used  extensively  by  the  Interagency 
Information  Exchange  committee  and  the  DOE-  Technical  Information  Center.  We  are  beginning  to  prototype  an 
Integrated  Information  Network. 

"Database  Management  Capabilities" 

Information  is  the  total  of  textual  and  numeric  data  displayed  in  a  meaningful  manner.  Most  systems  specialize 
in  one  or  the  other.  Also,  most  database  management  systems  require  a  computer  programmer  or  analyst  to  define  the 
schema  of  a  new  database  and  to  load  it.  Then,  when  this  is  done,  the  database  is  turned  over  to  the  user  for  retrieval 
and  updating.  When  special  features  are  required,  e.g.,  more  complex  reports  or  graphical  output,  the  services  of  a 
programmer  are  again  needed. 

The  Technology  Information  System  (TIS)  offers  the  traditional  database  management  procedures,  but,  in 
addition,  TIS  has  the  capability  of  direct  database  management  by  its  users  without  programmer  intervention.  This 
permits  the  use  of  TIS  as  an  extension  of  the  yellow  note  pad,  or  desk  calculator.  Thus  we  distinguish  on  TIS  two 
categories  of  databases,  public  and  private. 


The  Information  in  these  databases  is  displayed  in  a  hierarchical  manner  and  can  be  selected  with  simple 
specification  of  an  "Option  Number,"  not  unlike  what  you  find  in  some  of  the  popular  word  processors.  However,  a 
database  on  TIS  is  a  collection  of  programmatic  resources!  data  files,  models,  and  electronic  communications  are  all 
different  options  of  the  same  database  and  can  be  tailored  to  individual  programmatic  requirements.  Six  major 
databases  are  on  the  system  now.  We  name  especially  the  database  established  by  the  Transportation  Systems 
Research  (TSR)  program®'  for  DOE/ESR,  the  STBS  database,  and  that  which  contains  the  installation  and  technical 
specifications  of  thousands  of  expensive  pieces  of  optics  for  the  SHIVA/NOVA  laser  fusion  program  at  LLNL.68  An 
extract  from  the  display  of  the  TSR  database  follows! 
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The  public  databases  are  intended  for  general  use  in  support  of  a  particular  program.  Their  information  content 
can  be  viewed,  used,  and  extracted  as  required.  Temporary  changes  to  these  data  can  be  done  for  display  or  for  ad  hoc 
exploratory  calculations  by  any  user.  When  such  changes  are  made,  they  are  annunciated  in  the  input  record 
documenting  a  report  or  model  run.  Permanent  changes  can  only  be  initiated  by  the  originators  of  the  data  with  the 
help  of  the  Database  Administrator. 

In  the  private  database,  we  offer  the  additional  capabilities  of  database  creation.  The  create  command,  starts  a 
self-guided  routine  that  permits  you  to  establish  a  hierarchical  index  for  information  in  your  own  database  system. 
You  can  specify  and  name  the  data  files,  and  are  prompted  to  describe  each  data  field,  indicating  whether  it  will  be 
used  for  textual  data,  integer  data,  or  floating-point  data;  you  are  then  asked  to  name  the  units  of  measurement  and 
select  an  acronym  by  which  you  may  wish  to  refer  to  the  data  field  in  the  future  as  an  equally  valid  name  for  the 
corresponding  Option  Number. 

When  these  self-guided  definitions  are  completed,  data  can  be  entered  key-to-disk,  from  menu-driven  forms  that 
flash  on  the  cathode  ray  screen.  These  display  formats  can  be  activated  by  initiation  of  the  self-guided  makeform 
routines  and  can  be  called  into  action  as  needed  by  name.  In  several  eases  we  have  had  good  results  with  data  input  of 
this  type  cross-country  by  secretarial  help.  The  data  fields  are  explicitly  called  out  on  the  screen.  When  text 
characters  are  inadvertantly  entered  into  a  numeric  data  field,  the  terminal  keyboard  locks  and  signals  the  error  for 
immediate  correction  before  proceeding.  The  ipdate  command  is  used  to  append,  replace  and  replicate  data.  It 
provides  help  instructions  for  the  searching  of  erroneous  records  which  can  then  be  corrected  in  a  systematic  manner. 
Magnetic  tapes  are  used  for  input  when  larger  volumes  of  data  are  involved.  Data  can  also  be  transfered  over 
telephone  lines  or  over  the  ARPA  computer  network.  In  the  latter  case,  we  can  accept  data  at  an  effective 
transmission  rate  of  36,000  bps. 

"TIS"  GIVES  YOU  THE  CAPABILITY  TO  CREATE  YOUR  OWN 

-TIS-  OPERATIONAL  RESOURCES: _ U  INFORMATION  SYSTEM:  _ fc3 
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MmMb:  14  Etactric'Mhick  mod*h 

1  STES  Modal 

Communication*:  64  Communication  poets 

300-baud  taMphona  (mas  (Comm  *  FTS) 

12004aud  tatophon*  (mas  (Comm  *  FTS} 

300/1200-tNud  WATS  Unm 
ARPANET 
Talaphuna  dial -out 
Dadkatad  I  mat,  on  uta  at  LLL 
VOTRAX  Voka  symhaauar 
TYMNET/TELNET 

Othar  oomputan:  22  Automatad  linfct  to  otfwr  computort/canWPt: 

OOE/RECON.  NASA/RECON,  DOE  tftamatnu 
fuafe  (OK).  SERI.  OOT/TSC.  LSL4EEDIS. 

MIT  MACSVMA.  Wortdnam  INYT,  UP),  ate. 


The  display  and  extraction  of  information  or  numeric  data  can  be  carried  out  in  two  ways.  First,  each  public 
data  file  comes  equipped  with  one  or  several  preferred  display  formats,  also  referred  to  as  reports.  These  can  be 
activated  by  name  and  provide  the  option  to  specify  Boolean  logic  for  the  selection  of  those  records  that  satisfy 
numeric  and/or  textual  criteria.  Reports  can  be  graphs  or  tables.  You  can  choose  those  that  fit  your  terminal. 
Second,  you  can  create  your  own  reports  by  initiation  of  the  print  or  plot  commands.  These  routines  guide  you  to 
indicate  the  data  file  for  which  the  report  is  intended,  the  data  fields  to  be  printed,  summed,  labeled,  ordered, 
footnoted,  etc.  Plots  can  be  seen  in  black  and  white  on  Hewlett-Packard  2648  terminals,  or  in  color  on  HP  7221  X-Y 
color  plotters.  The  latter  can  be  used  to  make  viewgraphs  directly.  These  display  patterns  can  be  combined  with  text 
for  reports  which  are  activated  by  name  as  an  automated  sequence  of  commands.  A  device-independent  interface  is 
being  prepared  for  graphics  display  on  terminals  from  other  manufacturers.  In  the  example  below,  which  shows  a 
scatterplot  of  the  6200  Eutectic  salts,  created  on  TIS  rrom  the  magnetic  print  tape  to  the  corresponding  NBS 
publication,6*  we  discovered  7  salts  where  the  percentages  of  constituent  materials  exceeded  100%.  These  errors 
were  brought  to  the  attention  of  NBS/OSRD.  This  clearly  illustrates  the  power  of  database  management  for 
Information  Analysis  Centers. 

When  needed,  numeric  data  can  be  extracted  for  later  use.  They  are  prepared  through  the  print  command  and 
then  saved  in  separate  files  which  serve  as  direct  input  to  models  or  as  inclusions  in  electronic  mall. 
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T/f>«  •*  Optl**  NMUr,  •  CW*M,  or 

■aiwtn  WK't  i«all  «  ool2>  >  100. 


Us  Use  fait  Cutset  ic  N*i't  were  (*«n  •  w*iZ)  >  i00. 


worn*  m.r  cuuctic  dwtw 


KH.  X 


TJ (Cl  ■tfCHCHCC* 


209  BICU'NeC  >  -TiCIl 
ICC  mCO-POCD 
490  NICU-KCI 

c*ci2-N*c 


2IK  CeC'Z-uiCi 
4919  wef-T H»4 
«C»4  KT'NlF 


0.t-3?.*-*.9* 

IM1 
*  I  -95.9 
92.9-99 


I3-C4  » 00.00 
UK  ’00.00 
99. 9 'CO 


33l 

III! 


99 

1C) 

29C 


99 

<2 

i?9 


Tv#S  S"  Option  * 


Every  report,  graph,  and  sequence  of  presentations  can  be  used  initially  only  by  the  creator.  A  positive  permit 
command  is  required  to  share  the  information  with  someone  else,  a  grotp  of  co-workers,  or  the  user  community  as  a 
whole.  Both  in  the  public  and  private  databases,  the  user  is  shown  the  availability  of  only  those  data  files,  report 
formats,  and  display  patterns  to  which  he  has  access. 

For  those  who  like  to  venture  into  more  complex  work  with  data  files  and  reporting,  many  powerful  UNIX  utility 
routines  are  available.  All  routines  are  documented  online. 


Help  is  available  online  for  most  programs.  Commands  with  many  parameter  options  give  help  during  initiation 
by  the  user  prior  to  execution.  You  may  type  "help"  at  each  step  to  receive  guidance  for  the  next  question  to  be 
answered.  We  offer  also  online  tutorials.  This  is  decribed  later  when  the  link  command  is  discussed  under  TIS 
communications. 

"Modeling" 

The  execution  of  simulation  models  for  performance  prediction  of  energy  storage  systems,  or  for  technical  and 
economic  analysis,  can  be  carried  out  interactively  or  in  the  batch  mode,  in  three  ways  as  shown  below.  We  prefer  the 
first  mode,  which  permits  separation  of  data  files  from  the  models  with  inherent  advantages. 


MODELING 


Old  method: 

•  Batch  procewng 

•  Fawuiart 

•  Data  and  Program  intartwinad 


New  method: 

•  In  tar  active 

•  Manyuaan 

•  Data  and  Program  raparatad 


o  The  model  may  reside  on  TB,  which  controls  its  input  and  output. 

o  The  model  may  reside  in  ITS,  and  be  activated  by  TB.  but  prompting  is  carried  out  under  model  control. 
This  is  usually  the  case  for  imported  models. 

o  The  model  may  reside  on  another  computer  elsewhere  in  the  country,  but  is  controlled  by  TB  with  regard 
to  input  or  output 

The  first  method  is  well  represented  by  the  group  of  models  developed  at  LLNL  for  performance  prediction  of 
electric  and  hybrid  vehicles,  powered  on  various  energy  storage  devices.  Originally,  these  models  were  used  in  the 
batch  mode  and  the  code  was  intertwined  with  input  data.  Their  use  was  chfficult,  limited  by  computer  printouts,  and 
best  left  to  those  familiar  with  their  intricacies.  To  make  them  interactive,  and  to  prepare  them  for  use  by  others, 
the  modelers  prepared  succinct  statements  that  describe  the  purpose,  assumptions,  methodology,  and  limitations  of 
each 
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model.  These  descriptions  were  then  used  for  their  online  documentation  in  T1S.  Next,  the  modelers  wrote  the 
interactive  script  that  describes  each  parameter,  assumptions,  modeling  techniques,  and  limitations.  During  execution 
of  the  model,  the  user  is  prompted  to  select  categories  and  input  parameters  in  a  self-guided  manner.™*  <* 

Type  an  Option  Number,  a  Command,  or  “'stop".  Type  an  Option  Number,  a  Command,  or  "atop" 

*5  *5.3 


5  Interactive  Vehicle  Node  1  a 


5.1 

5.2 

Electric  Vehicle  Data 

Standard  Driving  Cycle* 

5.3 

5.4 

Electric  Vehicle  with  Optimized 
Battery-flywheel  Vehicle 

Batter  y 

5 . 5 

EXXON  Energy  Storage  Evaluation 

Mode  1 

5 . 6 

JPLEV  (Elect.  Veh .  Model) 

RESTF 

5.7 

5.8 

HI uml num/fll r  Battery  Vehicle  Model 

CRRfi  Electric  Vehicle  Model 

Type  an  Option  Number,  a  Command,  or  "stop", 

* 


5.3  Electric  Vehicle  mth  Optimized  Battery 


5.3.1 

Model  Purpose  and  Resumptions 

5.3.2 

The  Modeling  Technique 

5.3.3 

Baste  Vehicle  Types 

5.3.4 

Basic  Performance  Levels 

5.3.5 

Basic  Tims  Periods 

5.3.6 

Basic  Confidence  Levels 

5.3.7 

Battery  Descriptions 

5.3.8 

Or  i ving  Cyc les 

5.3.9 

Punning  the  Model 

Type  an  Option  Number,  a  Command,  or  "stop". 


The  descriptions  and  the  interactive  script  for  each  model  were  then  placed  in  a  small  data  file  and  were  made 
part  of  the  overall  Transportation  Systems  Research  database.  They  provide,  in  this  manner,  dynamic,  up-to-date 
documentation.  The  potential  user  can  familiarize  himself  with  any  aspect  of  the  model  background  by  selective 
reference  to  the  appropriate  Option  Numbers,  which  describe  the  origin,  purpose,  techniques,  limitations,  and  input 
data  files.  When  he  is  ready  to  run  the  model,  he  is  prompted  to  select  the  required  parameter  categories,  or  to  give 
his  own  values.  When  all  answers  have  been  received  by  TIS,  the  data  are  extracted  from  the  individual  data  files  and 
presented  for  viewing  and  confirmation.  Ad  hoc  changes  can  be  made  at  this  time.  They  affect  the  run,  but  not  the 
content  of  the  public  database.  Following  execution  in  real  time,  the  results  are  presented  in  tabular  form  or  as 
graphical  output.  We  have  also  devised  efficient  interactive  input  methods  which  the  user  may  wish  to  use  for 
repeated  execution  of  models. 


INPUT  TO  RODEL 


THI8  COMPUTER  RUN  WILL  START  WITH  A  BASELINE 
ICE  VEHICLE  DESCRIBED  AS  POUjOWB: 


OUTPUT  ntffll  HODCL 


the  •*«»•  4mtm,  tb*  following  pot-^otoro  ks*«  too*  «alesl«t*4 


2  paaseoger  car  (wt«2>  Halted  per f orwaaee  <pl*3> 

The  projected  «*e  I#  for  the  federal  erbaeoyole  (4o-3>  for  the 
1980-1990  tlx  period  <tp>2)  oa  a  OO*  coaftd.  predlotloa  Col-2) 

The  above  baaellao  vehicle  laplleo  the  following  data* 

(An  eater-lab  preceding-  a  line  Indicate*  that  data 
changed  by  the  liar.) 


. . FH%EB  ' 3 

pasaeager  car 

faf  ) 

Coef f lelent  of  4r*g 

fed) 

Cork  wslghl  of  v«kl«lr 

lew) 

Vehicle  length 

loll 

Power  ayolea  wo  low© 

(paw  ) 

power  ayilso  weight 

(paw) 

Pvwe r- to -wo <ght  ratio 

(par) 

Deal  rod  range 

(rd) 

Efficiency  of  wt r/cont rol 1  or 

Ulane  > 

Weight  propagation  factor 

(wpfl 

Rotor  weight  to  powor  ratio 

(wnpr  > 

Rotor  yolsnso  to  powor  ratio 

(vapr) 

Weight  of  payload 

(wp) 

»  <vt»2> 

'  18.000  Mft 

0.300 
427.000  kg 

♦.too  ri 

4.300  Cgft 
109.000  bg 
0.024 
120.440  k* 
0.850 


•  I .300 

•  4.300  kg/Uw 

■  0.040  c«ft/fcw 

«  134.000  bg 


lmrX  ft  fin  UltHfl 

rat eorve  lolereept 
■peel ||c  energy 


vehicle  ter. » 

pay lead 

total  teat  «t . 
vehlele  leagth 
bat  teey  w« . 

■©tor  ♦  root  rot  ter  at. 

I ran-oiaalea  at . 

road  load  energy  por  eycla 


«/bg 

w.fc'kg 


989.94 
13* .44 
1494.84 
I9.lt 
4*0  14 
122.06 
31  .09 
1401 .600 


thla  page  nf  aalpal  la  h©|4  la  (lla  wowed  event 
(■unld  yoo  like  to  rara*  thla  mo  del  at Ih  aero  ehoogee? 
<T»pe  la  •yea",  ar  "no*.) 


An  example  for  the  second  class  of  models  is  the  EXXON  econometric  model  for  electric  cars,  it  is  available 
on  TB,  but  its  prompting  is  that  originally  devised  by  EXXON.  Here  TIS  simply  acts  as  the  controller  for  the  model 
run  and  provides  a  convenient  means  of  initiation  and  execution.  Any  model  which  can  be  compiled  and  processed  on 
the  PDP-11/70  machine  can  be  integrated  into  a  TIS  database  in  this  manner. 

The  third  type  of  modeling  capability  on  TIS  is  equally  powerful.  Here  the  model  is  executed  on  a  foreign  host 
computer  under  TB  control.  TB  connects  an  authorized  user  to  the  distant  computer  automatically  and  activates  the 
named  model.  An  example  is  the  Electric  Vehicle  Model  (ELEVEC)  at  Jet  Propulsion  Laboratory.72  Another 
example  is  the  "CCC"  Thermal  Aquifer  Model73  under  development  at  Lawrence  Berkeley  Laboratory  (LBL)  for  the 
Seasonal  Thermal  Energy  Storage  Program.  "CCC"  was  moved  from  LBL  to  the  Solar  Energy  Research  Institute 
(SERI)  computer  to  become  part  of  their  solar  energy  model  library.  It  requires  a  CDC-7600  andoonsiderable  time  to 
execute.  In  this  case,  TIS  is  used  to  prepare  the  input  file  for  "CCC"  execution  at  SERI  and  the  retention  and  analysis 
of  results. 

This  technique  provides  a  very  powerful  capability  for  the  execution  of  models  at  any  site  under  TIS  control.  It 
also  offers  the  possibility  of  preparing  input  from  common  data  files,  executing  the  models  at  different  sites  under  TIS 
control,  and  comparison  of  calculated  results  on  TIS  to  establish  the  relative  accuracy  of  models.  (Candidates  for  this 
effective  procedure  are  the  national  energy  models,  which  can  not  readily  be  moved  from  their  home  base.)  With  this 
approach,  the  user  can  also  avail  himself  of  any  improved  version  of  the  model  when  it  becomes  available. 
Significantly,  the  models  do  not  have  to  be  translated  for  use  elsewhere  or  divorced  from  the  creative  work  of  their 
originators.  We  are  looking  forward  to  an  opportunity  of  using  TB  in  this  capacity. 

The  output  from  large  models  can  be  voluminous  and  difficult  to  handle.  As  a  remedy,  we  envision  the 
extraction  of  significant  calculated  parameters  from  the  resulting  large  output  file  for  viewing  and  decision  making  on 
TB.  The  bulk  of  the  output  file  could  be  left  at  the  remote  host  machine.  TB  would  maintain  the  index  and  the 
bookkeeping.  Alternatively,  as  being  planned  for  the  "CCC"  model,  the  output  file  could  be  transferred  to  the  LBL 
computers  under  TB  control  for  post-processing  on  LBL  machine*  and  printing  on  their  peripheral*.  The  effective 
transfer  of  such  large  volumes  of  data  requires  high-speed  communications  with  transmission  rates  of  at  least  9,600 
bps. 
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Modeling  requires  programming.  The  major  languages  available  on  TIS  today  are: 


FORTRAN  IV  APL 

PASCAL  "C" 


MACRO  0 
BASIC 
SNOBOL 
LISP 


RATFOR 

DC 

MB 

AS 


Several  powerful  text  editors  are  supported  by  the  UNIX  operating  system  and  provide  online  editing  capabilities  for  a 
variety  of  different  terminals.  Large  programs,  requiring  extensive  computer  time,  can  be  scheduled  by  TIS  for 
compilation  cr  execution  at  night.  The  results  of  calculations  can  be  saved  in  a  user-specified  library  of  data  files. 


MODELING  ON  "TIS'’.  COUTH: 


H 


*  Hwmur  studio,  oil -documenting 

*  Graphical  trand  amtym 

*  HictDrical  record  of  rasuta 


ICE  <i 


battary 

EV* 


Mi  Fa 

Ni-Zn 
Zn-Cfe 
LiMM2 
MaS  I  tar  I 
NrSf*M| 


All 

battary 

EV* 


Pb-ectd 

Mi-Fa 

Ni-Zn 

Zn-Ctv 

LiAiFeSZ 

Ma-S(oar) 

NaSlP^I 


Projected  initial  coat  (1977$)  of  anargy- 
storage  automobilaa  with  reduced  performance  deaigned 
to  maximise  the  uae  of  electricity. 


Several  statistical  and  graphical  analysis  routines  are  available  on  TIS.  We  have  established,  especially  in 
graphics,  a  number  of  powerful  programs,  some  of  which  permit  online  input  in  a  prompting  manner.  This  pertains  to 
the  creation  of  bar  cherts,  pie  charts,  and  milestone  charts  for  administrative  purposes.  The  graphs  can  be  prepared  in 
color  as  bard  copy  or  directly  as  viewgraphs.  Once  created  and  named,  the  resulting  format  file  can  be  released  for 
use  by  others  elsewhere  and  printed  near-instantaneously  cross-country  on  compatible  equipment. 

"Com  muni  cations" 


Effective  communications  are  essential  for  information  transfer  among  co-workers  in  different  time  zones.  TIS 
offers  the  following  cepabilities: 

comment  -  a  self -prompting  routine  to  send  public  messages  to  the  TIS 

database  administrator. 

write  -  a  diascript  between  two  users. 

link  -  provides  tutorials  for  one  or  a  grotg>  of  users. 

electronic  mail  -  serves  the  entire  user  community,  inclusive  of  voting,  and  the 

joint  preparation  of  reports. 

interconnection  with  word  processors  -  permits  the  transmission  of  letters  and  reports  via  TIS. 

Work  an  an  integrated  system  like  TIS  invites  comments  and  requests  for  improvement  to  the  TIS  administrator. 
The  self-guided  comment  routine  offers  this  capability.  These  suggestions  can  be  viewed  by  all  users  of  the  system 
and  offer  an  opportunity  to  see  the  requests  made  by  others,  and  the  corrective  response  by  TIS  management. 

The  write  command  is  a  diascript  between  two  users  logged  in  on  TIS  at  the  same  time.  The  write  command, 
followed  by  the  name  of  the  user  you  wish  to  reach,  prints  an  alert  message  on  the  remote  terminal.  A  similar 
confirmation  from  the  other  side  is  required  to  establish  communication.  Typing  than  takes  the  place  of  a  dialogue.  A 
signal  can  be  typed  to  indicate  the  end  of  a  question  or  statement,  invitiiv  the  response,  and  so  forth.  (If  you  should 
be  doing  serious  work  on  an  editor  or  in  graphics,  and  do  not  wish  to  be  interrupted,  you  can  issue  the  message  off 
command  to  silence  any  disturbance.) 


STUTOSIAL 


n 


ALUM. 


The  link  command  is  used  for  tutorial  purposes.  By  previous  agreement,  it  permits  any  two  users  to  work 
together.  One  user  becomes  the  teacher  and  works  in  the  student's  account.  A  dropfile  can  be  created  for  subsequent 
perusal  of  the  joint  transactions.  This  capability  is  being  used  by  TIS  staff  for  cross-country  tutorials.  They  are 
especially  effective  when  used  with  a  voice  phone,  permitting  the  student  to  see  and  hear  instructions  simultaneously. 
Arrangements  can  be  made  for  class  tutorials. 

Electronic  mail  (em)  permits  you  to  send  and  receive  messages,  to  answt-1  M  forward  mail,  to  issue  group  mailings, 
and  to  file  correspondence  in  a  mail  filing  system  of  your  own/4 

Some  26  different  options  are  available  to  compose  and  edit  messages  and  reports,  correct  spelling  by  reference 
to  the  online  Webster’s  dictionary,  send  blind  copies,  and  check  whether  an  addressee  may  have  already  read  your 
mail.  Of  course,  you  can  delete  all  mail. 

MAJOR  OPTIONS  OF  «m: _ M  READING  ANO  DISPOSING: _ M 


Online  help  is  available  for  all  options.  Commands  can  be  executed  by  their  imperative  or  starting  letter.  This 
creates  a  very  user-friendly  work  environment  for  beginners  and  experts. 

Interconnection  with  Text  Processors.  We  established  the  capability  to  connect  TIS  with  several  word 
processors:  WANG,  LEXITRON,  and  QYX.  Connections  to  the  FOUR  PHASE  system  and  VYDEC  are  planned.  When 
used  in  conjunction  with  electronic  mail,  any  letter  or  report  typed  on  a  word  processor  can  be  made  part  of  an 
electronic  mail  message  and  sent  on  its  way  to  the  destination  ahead  of  any  written  confirmations.  Incompatible 
control  characters  among  some  of  the  different  word  processor  systems  are  translated  by  TIS  as  required. 


"Distributed  Networking" 

This  is  a  very  powerful  TIS  capability.  It  permits  connection  and  use  of  other  information  centers  and  computers 
in  an  automated  and  controlled  manner.  At  the  present  time,  we  have  provisions  for  access  to  22  other  centers,  thus 
vastly  multiplying  the  information  content  and  capabilities  or  TIS.  The  Network  Access  Machine  software  (NAM)  used 
on  TIS  for  this  gateway  function  stems  from  earlier  work  by  the  NBS  Computer  Institute  for  Science  and 
Technology.7*’7® 

To  establish  a  connection,  the  arrangements  require  only  one  contract  with  TIS.  Individual  users  on  TIS  are  then 
granted  access  as  needed  for  the  duration  of  their  work.  Audit  files  keep  accurate  records  of  all  transactions  for 
accounting  purposes. 
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Ammptmf  Mtaphont  connection  «t  30(H»ud  to  DOE/ RECON 

at  ORNL.  Term 

Connection  mode 

Anwnptinf  tofin  to  DOE/RECON 

Lofin  compim 

DOE/RECON  n  ready.  PImm  enter  you r  raqum: 


SELECTIVE  SHARING  OP  DATA _ 


The  significant  aspect  of  this  capability  is  that  individual  users  of  TIS  need  not  learn  the  access  protocols, 
passwords,  or  peculiarities  of  the  foreign  host  computers.  They  simply  select  the  information  center  by  Option 
Number  or  by  target  name.  TIS  does  the  rest.  The  power  of  this  gateway  approach  was  recently  demonstrated  during 
the  7th  international  CODATA  conference  in  Kyoto,  Japan.  With  a  simple  command,  "connect  PARC,"  TIS  established 
a  computer  communication  from  Kyoto,  Japan,  via  TIS  at  LLNL,  to  the  DARC  system  at  the  Institute  of  Topology  at 
the  University  of  Paris,  France.  77  For  extended  periods  of  time  each  day,  interactive  graphics  of  the  sophisticated 
DARC  system  were  demonstrated  in  Japan,  in  full  duplex  at  1200  baud,  without  any  notifiable  delay  in  transmission. 

TIS  is  also  capable  of  retaining  the  viewed  or  extracted  information,  derived  from  a  foreign  host,  in  the  user's 
account  for  subsequent  processing  and  use  where  legally  permissable.  A  cogent  example  is  our  interconnection  to  the 
extensive  DOE/RECON  information  system.  All  citations  retrieved  can  be  placed  into  a  file,  aggregated,  and 
processed  interactively  online  for  the  immediate  creation  of  subject  and  author  indexes,  or  for  topical  concordances 
and  text  analysis  in  general.  Any  bibliographic  field  element  can  be  cross-correlated  with  any  other.  Where  required, 
citations  can  be  complemented  with  key-to-disk  annotations  about  their  relevancy  and  ranking.  Requests  for  full-text 
copies  can  be  issued  automatically.  Citations  can  be  augmented  with  numeric  or  descriptive  data  derived  from  the 
reports.  Publication  statistics  can  be  shown  graphically,  online,  and  enhance  further  the  insight  possible  from  the 
retrieved  information.  This  opens  new  vistas  for  extraction  of  higher  intelligence  from  descriptive  text  in  science 
and  technology. 
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TIS  -  Command! 

•  tramlatad  info  foraifn  boat  language 


a  for  ptttam  recognition  and  extraction 

We  expect  to  have  similar  links  soon  with  NASA/RECON  and  with  the  unclassified  Defense  Technical 
Information  Center  (DTIC).  These  files  are  in  the  public  domain  and  could  be  used  to  establish  comprehensive, 
well-indexed  bibliographies,  now  carried  out  more  laboriously.  This  capability  is  equally  applicable  to  numeric  data 
and  offers  the  opportunity  of  data  aggregation  from  different  sources  into  one  topical  summary.  Use  of  commercial 
systems  in  this  manner  requires  careful  study  of  legal  aspects  and  contractual  aggreements.  With  regard  to  the 
transfer  of  technology,  we  are  making  good  use  of  the  following  procedure  which  does  not  require  transfer  of  the  data 
or  models  to  TIS,  and  retains  full  control  in  the  hands  of  the  originators:  A  computer  account  is  opened  for  the  TIS 
user  community  on  the  remote  host  computer  where  the  resources  are  located.  The  owners  of  the  data  and  models 
release,  periodically,  those  versions  which  they  are  prepared  to  share  with  TIS. 


Tit  Unr  Community  Compunr 


NO  TRANSLATION/  ORIGINAL  DATA  AND  MOD€LS  ART  US€Of 
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CONCLUSION 

I  believe  we  can  look  forward  to  the  1960s  with  excitement.  In  the  past,  technology  often  lagged  behind  our 

information  requirements.  The  reverse  may  soon  be  true.  We  probably  find  it  difficult  to  imagine  ourselves  usii^  a 

pocket-portable  flat-screen  CRT  terminal,  homed  in  on  a  satellite  and  capable  of  searching  and  reading -- wherever 
we  are  -  -  any  of  the  S.2  million  retrospective  MARC  files  of  the  Library  of  Congress  now  being  installed  on  DIALOG, 
or  conducting  a  molecular  structure  search  on  the  4-million-large  Structure  and  Nomenclature  Search  System,  soon  to 
be  offered  by  CAS  and  CIS.  On  the  home  front,  the  new  9-digit  ZIP  code  of  the  U.S.  Postal  Service  should  be  capable, 
by  itself,  of  delivering  mail  to  anyone  in  the  United  States  directly  by  cross-correlation  with  computer-based  address 
lists  and/or  social  security  numbers.  Orwell's  1984  is  only  three  years  off. 

The  "Network  Nation"  is  more  than  the  title  of  a  recent  book.78  It  marks  the  beginning  of  a  new  decade  in 
which  intelligent  factual  information  may  become  the  scarce  and  costly  resource.  National  and  international 
organizations  are  trying  to  come  to  grips  with  this  situation.  CODATA's  emphasis  is  on  data  in  science  and 
technology.79  UNESCO  recognizes  basic  requirements  for  factual  data  in  developing  countries.8”  Regional 

information  centers  in  the  Far  East  and  Africa  are  being  planned  that  could  transfer  the  know-how  of  the 

post-industrial  countries  to  those  anxious  to  learn,  but  foresighted  enough  not  to  repeat  the  mistakes  of  the  past. 
There  are  also  signs  of  concern,  one  of  which  is  the  abuse  of  information.  We  are  learning  that  CIS  is  bringing  online 
for  worldwide  access  commercially  nonconfidential  data  and  production  volumes  of  chemical  manufacturing  plants  in 
the  United  States.91  We  are  also  aware  that  some  countries  find  it  expedient  and  in  their  interest  to  model  the 
economy  of  the  United  States  with  U.S.  demographic  and  time-series  data  on  U.S.  computers  . 

In  our  democracy,  free  access  to  information  is  essential.  We  should  not  be  remiss  in  using  it  ourselves,  first! 

May  God  grant  us  the  wisdom  to  know  how  to  proceed.92 
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Online  systems  and  networks  used  by  Norwegian  searchers 
are  outlined.  The  Nordic  information  network  SCANNET  is 
described.  User  problems  and  needs  are  presented,  in  the 
light  of  the  work  in  online  user  groups. 


Online  information  systems  are  young,  only  a  little  more  than  a  decade.  This  new  infor¬ 
mation  technique  has  developed  strongly  since  the  very  beginning,  and  user  requirements 
in  handling  such  systems  have  proved  to  be  valuable  factors  in  the  development  of  online 
services  as  well  as  networks.  As  users  of  online  databases  become  more  experienced  and 
better  aware  of  the  different  search  possibilities,  they  make  recommendations  which 
should  affect  the  further  development  of  the  online  services,  and  I  think  they  will  be 
influential  even  to  a  larger  measure  during  the  eighties,  in  forming  the  future  online 
information  services. 


DEFINITIONS 

First,  let  me  explain  who  are  the  users.  They  are  primarily  the  searchers  of  the  online 
databases,  in  other  words  the  intermediaries,  brokers,  information  officers,  those  who  are 
formulating  the  search  strategies,  selecting  the  systems  and  choosing  ’•'le  files,  using  the 
terminals  and  keeping  up  with  the  news  and  changes  of  the  services. 

But  there  are  also  the  end-users,  those  who  are  not  actually  doing  that  preparatory  work, 
but  are  the  real  users  of  the  information  in  their  work. 

Thirdly  there  are  the  non-users,  users  who  probably  are  not  aware  of  the  online  services 
and  are  potential  future  users. 

All  groups  have  their  particular  requirements,  and  it  is  obvious  that  the  system  and  net¬ 
work  designers  must  take  such  requirements  into  consideration  in  order  to  make  their  on¬ 
line  services  as  attractive  to  the  users  as  possible.  There  will  be  a  greater  struggle 
in  attracting  users  to  the  systems  as  competition  grows. 

The  online  systems  are  in  this  connection  the  systems  we  are  using  for  information  retrie¬ 
val,  with  bibliographic  and  referral  as  well  as  numeric  and  fact-oriented  databases. 


TODAY'S  SERVICES 


My  reflections  upon  the  user  needs  are  based  on  the  online  databases,  systems  and  communi¬ 
cation  networks  we  are  using  in  Norway  today,  especially  in  the  fields  of  science  and  tech¬ 
nology.  Let  me  start  with  an  overview  of  the  Norwegian  situation,  where  my  basis  is  the 
systems  and  networks  used  by  the  library  of  the  Norwegian  Institute  of  Technology.  Our 
library  documentation  services  are  offered  to  the  staff  and  students  of  our  university  as 
well  as  to  our  home  industry,  research  institutions  and  to  the  public,  in  keeping  with  our 
library's  assumed  duties  as  central  library  of  technology  for  our  entire  country. 


We  started  the  online  searching  in  1973  with  MEDLINE,  by  direct  long  distance  calls  to  the 
database  in  Sweden,  after  some  years  of  batch  SDI  experiences.  In  1975/76  we  started  as 
online  users  of  the  Lockheed,  SDC  and  ESA-IRS  systems.  In  1978  we  installed  a  1200  baud 
terminal,  and  that  higher  speed  is  now  used  in  the  online  searching  when  possible. 

The  online  information  services  and  networks  we  are  using  today  are: 


DIALOG 

SDC 

NLM 

ESA-IRS 

SCANNET 

!l 

TECHNOTEC 

BLAISE 


TYMNET 


Amsterdam 

It 


ESANET 
NORPAK 
UN I NETT 
CYBERNET 
Direct  call 


Copenhagen,  Stockholm 
Oslo 

University  of  Trondheim 
Oslo 


In  addition  we  are  using  some  other  online  databases,  e.g.  library  catalogs  ’S  our  own 
BIBSYS.  This  means  several  different  passwords  to  networks,  systems  and  d  ses. 

The  users  of  our  documentation  services  are  primarily  interested  in  bibliographic  data. 

We  are  concerned  about  the  growing  number  of  numeric  databanks,  but  the  library  is  well 
equipped  with  reference  literature  and  data  books  in  all  fields  of  interest,  so  there  are 
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not  many  questions  about  searching  online  numeric  databanks.  There  is  a  growing  demand 
for  market/business  information,  but  the  bibliographic  information  in  science  and  tech¬ 
nology  is  still  most  asked  for. 

From  Norway  we  get  connected  to  the  US  information  systems  by  calling  the  Amsterdam 
TYMNET  node.  Changes  in  that  procedure  will  be  made  in  the  near  future,  when  the  Norweg¬ 
ian  NORPAK  node  will  be  connected  to  US  information  networks. 

Norwegian  online  users  have  no  access  to  EURONET,  but  we  are  waiting  for  it,  for  instance 
through  the  Scandinavian  network  which  will  be  described. 

SCANNET  NETIVOPK  AND  DATABASES 

SCANNET  is  the  Scandinavian  contribution  to  networks  and  databases.  It  was  established 
by  NORDEORSK,  the  Nordic  Co-operative  Organization  for  Applied  Research,  in  1976  as  an 
experimental  network  to  make  machine-readable  databases  and  -banks  accessible  online 
within  the  Scandinavian  countries.  It  went  into  full-scale  operation  in  1976  (1).  Plans 
to  interface  SCANNET  with  EURONET  and  other  networks  were  made. 

In  1979,  when  SCANNET  was  reorganized  into  a  foundation,  the  Nordic  PTTs  decided  to  make 
SCANNET  a  ”semi"-publ i<~  network.  The  telephone  lines  were  no  longer  leased  by  SCANNET, 
but  the  searching  procedures  for  the  users  remained  the  same.  Today  the  Scandinavian  net¬ 
work  is  a  packet-switching  service  in  the  telephone  network  (21.  Since  the  PTTs  took 
over  the  responsibilities  for  the  communication  part,  we  no  longer  have  the  old  SCANNET. 
The  network  consists  of  2  nodes,  Databas  300/1200  or  TELEPAK  in  Stockholm,  and  NORPAK  in 
Oslo.  Multiplexors  in  Copenhagen  and  Helsinki  are  connected  to  TELEPAK,  which  is  also 
connected  to  information  networks  in  North  America.  The  previous  node  computers  in 
SCANNET  now  have  a  function  as  front-end  processors  for  the  various  host  computers  con¬ 
nected.  The  function  of  SCANNET  in  the  network  can  probably  be  compared  to  DIANE  in 
EURONET/DI ANE . 

A  universities'  network  in  Norway,  UNINETT,  has  been  developed  and  connected  to  NORPAK. 

It  can,  therefore,  be  used  for  searching  the  SCANNET  databases.  The  connection  from 
TELEPAK  to  the  networks  in  the  USA  is  not  open  to  Norwegian  users,  but  as  mentioned 
earlier,  there  are  plans  to  connect  NORPAK  to  networks  in  the  USA. 

The  number  of  SCANNET  databases  offered  is  constantly  increasing.  Databases  are  (spring 
1981): 

ALIS  -  Technical  literature 

BVR  -  Swedish  building  market  -  products  and  suppliers 

BYGCDOK  -  Community  planning,  architecture  etc. 

CHEM  -  Factual  bases  on  crystallography  and  mass  spectrometry 

CIS/ILO  -  Occupational  safety  and  health 

EPOS  -  Profile  handling  system  for  VIRA,  SDI-service 

EXTEMPLO  -  Electronic  journal  within  library  and  information  sciences  in  the  Nordic 

countries 

FINP  -  Finnish  economics  and  business  periodicals  index 

INIS  -  Nuclear  energy  &  technology 

IRRD  -  International  road  research  documentation 

MARKNADSBANK  -  Swedish  market  information  bank 

MECHEN  -  Mechanical  engineering 

MEDIC  -  Finnish  medical  literature 

MEDLARS  -  Medicine,  biomedicine.  Several  databases: 

MEDLINF. 

CANCERLINE 

CHEMLINE 

RTECS 

POLYDOC  -  Several  databases: 

AID  -  Art i kke 1  - Indeks  Database 

from  journals  in  technology  and  management 
AIpr:  -  AJOUR  and  product  guides  in  technical  fields 

F.DI  -  Nordic  literature  in  library,  documentation  and 

in  format  ion 

ETABLERING  -  Norwegian  industrial  establishments  abroad 
EKSPORT- INDEKS  -  Export  markets 
FoU-INDEKS  -  Norwegian  R&D  Index 
OL.JF.- INDEKS  -  Oil  Index 

SHIP  ABSTRACTS  -  Ship  technology,  ocean  engineering 
STANDARDS  -  Norwegian  standards 

RF.CODEX  -  Locating  index  for  interdisciplinary  scientific  reports  in  Scandinavia 

SCANT  -  Scandinavian  economic-  nd  business  periodicals  index 

SCIMP  -  European  periodicals  in  business,  management  etc. 

SF.RIX  -  Swedish  environmental  research  index 

VANY1T  -  Environmental  technology 

Some  test  databases  arc  also  in  the  system. 

As  you  can  see,  about  30  databases  are  available.  Some  of  them  are  well  known  internat¬ 
ional  ones,  while  others  arc  small,  specific,  Scandinavian  databases  and  -banks,  most  of 
them  searchable  in  a  Scandinavian  language  which  of  course  is  a  barrier  for  international 
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use.  Yet  you  may  find  information  in  them  which  is  not  available  in  other  databases,  and 

I  am  sure  the  interest  for  the  SCANNET  databases  will  be  growing  also  outside  Scandinavia. 

NORDINFO,  the  Nordic  Council  for  Scientific  Information  and  Research  Libraries,  has  now 
taken  over  the  financial  responsibilities  for  SCANNET  from  NORDFORSK  and  is  also  respon¬ 
sible  for  value  additions  and  the  building  up  of  databases  in  the  information  network  C3) • 

In  supporting  databases,  priority  has  been  given  to  those  which  have  a  Nordic  connection 

and  contain  material  that  is  not  available  via  the  large  international  information  sys¬ 
tems.  Many  of  the  existing  databases  in  SCANNET  have  been  supported  by  NORDINFO,  and 
further  development  of  bibliographic  and  factual  databases  will  surely  result  in  several 
new  SCANNET  databases. 

ONLINE  USER  GROUPS 

It  seemed  in  the  first  years  o'f  the  "online  age"  that  people  were  mainly  discussing  how 
computers  could  be  used  in  storage  and  dissemination  of  information.  The  user  needs  were 
seldom  elucidated.  But  the  many  information  systems  and  networks  soon  gave  the  online 
searchers  various  practical  problems  in  their  work.  During  the  last  years  several  studies 
on  user  needs  have  been  published. 

In  1976-78  online  user  groups  started  their  activities  to  focus  their  attention  on  the 
online  user  problems  and  needs,  and  to  extend  and  spread  the  knowledge  of  information 
systems  and  networks.  At  the  First  International  On-Line  Information  Meeting,  1977  in 
London,  representatives  of  online  user  groups  met  informally  to  form  a  European  group, 
which  is  now  a  working  group  of  the  European  Association  of  Information  Services,  the 
EUSIDIC  Working  Group  "National  User  Groups".  One  of  the  aims  is  to  represent  the  inter¬ 
ests  of  the  users,  and  the  group  has  discussed  many  problems  and  made  several  recommenda¬ 
tions. 

NORDINFO  took  the  first  steps  to  elucidate  the  Nordic  online  user  problems  and  needs  by 
bringing  together  representatives  of  the  Nordic  online  users  in  1978.  This  meeting  resul¬ 
ted  in  a  report  to  NORDINFO  on  the  situation  (4)  and  further  cooperation  between  Scand¬ 
inavian  user  groups,  who  have  been  meeting  for  the  last  few  years  to  discuss  problems  in 
common . 

In  the  light  of  the  activities  in  the  user  groups,  and  from  the  experience  I  have  as  on¬ 
line  user,  especially  in  the  fields  of  science  and  technology,  I  call  attention  to  the 
following  user  problems  and  requirements: 

-  Passwords,  logging  in  procedures,  telecommunication  problems 

-  Choice  of  systems  and  files  to  get  the  best  information  at  the  lowest  price 

-  Search  capability,  knowledge  of  database  structures,  indexing  practice 

-  Availability  of  databases,  systems  and  networks,  exclusive  rights 

-  Pricing  policy,  annual  fees  versus  prices  for  actual  use,  type  charges 

-  Standardization 

-  Back-up  services,  easy  access  to  the  primary  documents,  online  ordering 

-  Administration  of  services,  statistics,  updating  of  user  aids,  marketing 

-  Education  and  training,  experienced  and  new  users,  information  officers  and 
end-users  (and  non-users) 

-  Terminals 

The  passwords  are  often  complicated  even  for  an  experienced  user.  Although  the  searchers 
are  getting  used  to  the  different  ways  of  connecting  to  the  systems  and  databases,  the 
"logging  in"  procedures,  a  uniform  mode  of  operation  with  simple  passwords  is  urgently 
required,  especially  for  small  users.  Network  problems  cause  extra  costs.  It  seems  that 
there  are  more  often  communication  breakdowns  than  there  are  prob*.ms  with  the  host  com¬ 
puters,  systems  or  search  programs.  Reliable  networks  are  necessary  for  efficient  search¬ 
ing.  In  Norway  the  telecommunication  costs  are  more  than  the  computer  and  database  costs 
when  using  the  US  information  systems. 

It  is  important  to  have  access  to  various  systems,  to  be  able  to  choose.  When  one  system 
is  out  of  function,  another  may  be  used  if  a  search  is  urgent.  Different  systems  may 
offer  the  same  databases,  but  users  should  be  aware  that  there  are  sometimes  differences 
in  the  way  they  can  be  searched.  Search  commands  differ  from  one  system  to  another,  and 
various  features  are  made  searchable.  A  database  can  be  split  into  sub-files  in  one  sys¬ 
tem,  while  kept  complete  in  others.  Prices  can  vary  between  hosts,  and  computer  responses 
may  differ,  which  also  affects  the  search  costs. 

Search  capability,  knowledge  of  database  structures  and  indexing  practice  are  important 
factors  in  the  online  systems  to  get  the  best  information  at  the  lowest  price.  Experience 
from  printed  abstracts  journals  is  valuable,  and  here  the  libraries  have  an  advantage  in 
their  large  abstracts  journals  holdings. 

Access  to  some  databases  and  networks  is  currently  restricted  in  various  ways,  e.g.  geogra¬ 
phical.  European  users  have  no  access  to  some  of  the  DIALOG  databases,  Norwegian  users 
have  no  access  to  EURONET  etc.  The  database  exclusive  rights  probably  originates  from  the 
database  suppliers,  but  the  goal  must  be  equal  access  for  all  users  to  publically  available 
databases . 


The  time  differences  between  countries  and  continents  have  been  a  problem  for  database  acc¬ 
ess  during  users  working  days.  The  service  schedule  for  the  USA  information  systems  seems 
now  more  satisfactory  for  European  users,  hut  there  are  still  systems  where  the  time  sche- 
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dule  may  be  a  problem,  e.g.  in  SCANNET. 

During  the  last  few  years  there  have  been  changes  in  policy  and  tariffs  adopted  by  vendors 
and  database  suppliers.  Paying  for  the  actual  use  of  a  database  is  preferable.  Annual  or 
monthly  taxes  are  a  drawback  being  often  too  expensive  for  small  users.  When  paying  for 
the  actual  use,  it  is  easier  to  invoice  the  end-user. 

Type  charges  which  have  been  brought  on  during  the  last  few  years,  are  an  unjustified 
burden  on  the  interactive  search  process.  It  is  important  to  study  references  while  doing 
an  online  search  to  make  it  as  efficient  as  possible,  and  some  type  formats  without  char¬ 
ges  are  necessary.  Fortunately  such  formats  seem  to  be  common  now  in  information  systems. 

There  is  an  increasing  demand  for  standardization,  especially  at  the  user-system  interface, 
but  it  should  cover  databases  as  well.  It  is  difficult,  especially  for  small  users,  to  be 
effective  when  every  system  has  its  own  special  routines.  Research  on  a  common  command 
language  is  in  progress,  but  one  should  have  in  mind  that  database  structure  and  special¬ 
ities  always  have  to  be  important  factors  for  effective  searching. 

A  standardized  output  is  preferable.  The  variations  in  printed  references  cause  problems 
for  end-users.  References  should  be  easy  to  read  and  understand. 

It  would  be  desirable  to  have  a  standardized  invoicing  practice,  which  is  one  of  the  points 
the  EUSIDIC  user  group  has  been  concerned  with. 

When  operating  an  online  service  it  is  important  to  offer  a  satisfactory  back-up  litera¬ 
ture  service  to  the  users.  The  online  searching  is  only  a  part  of  the  information  retrie¬ 
val  process,  and  the  full  text  of  a  document  must  be  made  available  to  the  user. 

The  large  libraries  have  here  an  advantage  in  their  collections  and  their  national  and 
international  interlending  partners.  Studies  are  being  undertaken  concerning  UAP,  Univer¬ 
sal  Availability  of  Publications,  -  the  achievement  of  the  widest  possible  availability 
of  published  material  to  intending  users,  wherever  and  whenever  they  need  the  information. 

Online  ordering  is  a  rather  new  facility  introduced  by  several  online  vendors.  The  service 
is  often  more  expensive  than  ordering  from  a  library,  but  it  may  be  an  efficient  way  to 
obtain  the  full  text  documents  at  a  reasonable  length  of  time.  Document  suppliers  in  the 
online  ordering  facilities  may  be  database  producers,  libraries  or  information  brokers. 

It  seems  that  database  suppliers  and  libraries  here  will  be  giving  the  best  service,  but 
until  now  few  studies  of  the  service  have  been  published. 

Education  and  training  is  important  on  all  levels,  for  new  and  experienced  searchers. 
System- independent  education  is  desirable,  in  addition  to  the  training  courses  sponsored 
by  database  producers  and  online  vendors.  There  have  been  prepared  many  user  aids  and 
workbooks,  also  for  self-instruction,  which  I  find  very  useful,  because  going  to  a  lot  of 
different  training  courses  is  time  consuming  and  expensive.  I  suppose  that  education  and 
training  will  be  further  developed  during  the  years  to  come.  The  user  groups  are  very 
much  concerned  in  this  matter. 

Choosing  a  terminal  for  online  searching  is  largely  a  matter  of  personal  requirements. 

Many  variations  are  possible  -  display,  printer,  combination  of  both,  portable,  integrat¬ 
ed  acoustic  coupler,  switchable  speed  or  parity,  small  memory,  programmable,  etc. 

I  prefer  a  video  display  terminal  and  printer,  with  switchable  speed  or  parity,  and  addi¬ 
tional  monitors  for  demonstrations  and  teaching.  A  memory  is  useful,  and  new  aids  to 
users  are  constantly  being  developed,  for  instance  microprocessor-assisted  terminals  for 
more  efficient  searching. 

Much  has  been  done  during  the  last  few  years  to  make  online  services  attractive  to  the 
users.  Many  system  and  network  design  problems  have  been  solved.  Vendors  and  producers 
have  realized  that  information  to  users,  user  aids  and  guidances,  are  important,  and  that 
information  about  services  needs  frequent  updating.  The  users  receive  more  information 
today  than  some  years  ago.  The  administration  of  online  services  is  manifold,  and  users 
are  very  much  concerned  about  it.  A  better  understanding  of  the  user,  the  information 
officer  as  well  as  the  end-user,  the  psychology  of  man-computer  interaction  and  the  human 
factors  affecting  the  quality  of  this  interaction,  seems  to  be  important.  Some  investiga¬ 
tions  have  been  made  in  recent  years  (S). 

Information  should  be  equally  available  to  whomsoever  seeks  it.  The  value  of  information 
is  however  not  realized  before  it  is  actually  used  by  the  end-user,  the  research  scientist 
or  decision  maker.  Our  knowledge  is  still  highly  insufficient  in  this  matter.  I  want  to 
conclude  with  some  words  by  Y.S.  Touloukian,  which  I  find  important:  "While  the  scientific 
and  technical  community  is  being  smothered  by  an  overwhelming  document  birth  rate,  it  star¬ 
ves  from  a  lack  of  evaluated  information"  (6).  That  is  a  problem  also  for  the  online  user 
in  his  effort  to  find  the  best  information. 
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SUMMARY 

Optical  disc  technology  includes  the  storage  and  retrieval  of  random  access  graphics 
on  videodiscs  as  well  as  the  storage  of  digital  information.  Optical  digital  discs  for 
computer  mass  storage  are  currently  under  development  by  many  firms.  In  addition,  efforts 
are  underway  to  allow  the  encoding  of  digital  information  on  videodiscs.  The  latter  is 
desireable  as  an  inexpensive  publication  medium  for  machine  readable  data  as  well  as  a 
means  of  obtaining  both  video  and  digital  information  on  one  disc.  Potential  applications 
of  this  technology  include  inexpensive  online  storage,  random  access  graphics  to  comple¬ 
ment  online  information  systems,  hybrid  network  architectures,  office  automation  systems, 
and  archival  storage. 


II.  INTRODUCTION* 

The  optical  videodisc  and  its  counterpart,  the  optical  digital  disc,  both  hold  great 
promise  for  information  storage  and  retrieval  (IS&R).  The  videodisc  is  already  finding 
applications  in  the  delivery  of  graphics  to  complement  online  interactive  information  sys¬ 
tems  as  discussed  in  Section  IV. A.  In  addition,  the  application  of  optical  discs  to  digi¬ 
tal  storage  represents  one  of  the  most  promising  developments  in  computer  mass  storage 
technology  in  recent  years.  While  this  new  technology  is  undergoing  a  rather  long  gesta¬ 
tion  period,  its  potential  for  IS&R  should  be  realized  in  the  80 's. 

This  paper  shall  first  review  the  techniques  for  storing  digital  information  on  opti¬ 
cal  discs.  Greater  detail  may  be  found  in  [1].  Subsequent  sections  shall  address  the 
potential  impact  on  IS&R. 


III.  OPTICAL  VIDEODISC 


A.  Recording  Technology 

There  is  considerable  activity,  internationally,  in  the  development  of  various  video¬ 
disc  recording  technologies.  This  paper  is  limited  to  optical  disc  technologies  because 
of  their  special  interest  to  IS&R.  Even,  here,  there  exist  different  approaches.  For¬ 
tunately,  two  firms  that  have  pioneered  in  bringing  the  videodisc  technology  to  the  mark¬ 
etplace,  MCA  (now  DiscoVison  Associates  or  DVA)  and  Magnavox-Philips,  have  agreed  on  a 
common  disc  format.  This  format  contains  information  coded  in  the  form  of  microscopic 
"pits"  in  a  reflective  information  surface.  The  information  surface  is  covered  by  a  tran¬ 
sparent  plastic  layer.  Playback  is  performed  by  focusing  a  laser  light  beam  on  the  infor¬ 
mation  surface  and  monitoring  the  reflected  light  pulses.  No  light  is  reflected  from  the 
pits.  The  reflected  light  is,  in  reality,  a  frequency  modulated  signal  resulting  from  the 
differing  lengths  of  the  pits  and  the  "land"  between  them.  Thus,  the  encoding  is  not  just 
a  series  of  0's  and  l's  as  as  so  customarily  the  case  for  computer  storage.  There  may  be 
as  many  as  14  billion  (14,000,000,000)  pits  per  side  of  a  one  hour  (2  sides)  videodisc 
with  spacings  between  concentric  tracks  of  about  1  micron  (39  millionths  of  an  inch) . 
With  these  dimensions,  dust  particles  of  just  a  few  microns  in  diameter  at  the  information 
surface  could  obliterate  many  signal  elements.  The  protective  transparent  surface  serves 
to  keep  such  dust  away  from  the  information  surface  and  outside  of  the  focal  plane  of  the 
"optical  stylus’,  thus  minimizing  loss  of  information. 

There  do  exist  optical  recording  technologies  other  than  those  utilizing  reflecting 
surfaces  and  pits.  These  Include  both  photographic  and  photochromic  processes.  A  list  of 
such  processes  is  given  in  [2].  In  addition,  the  Thompson  CSF  Videodisc  System  employs  a 
transparent,  non-reflec’.ing  disc.  In  this  system  it  is  the  transmission  of  light  through 
the  disc  rather  than  light  reflection  that  provides  the  information  carrying  signal  [3], 

B.  Production  of  Videodiscs 

Optical  videodiscs  are  produced  by  a  mastering  and  replication  process  [4].  In  the 
DVA  process,  material  to  be  mastered  is  first  recorded  on,  or  transferred  to,  a  two  inch 
helical  video  tape.  A  glass  master  disc  is  created  by  first  exposing  the  prepared,  pho¬ 
tosensitive  surface  with  a  laser  beam  modulated  by  the  output  of  the  signals  recorded  on 
the  two  inch  helical  video  tape.  The  glass  master  is  then  photoetched.  From  the  glass 
master,  multiple  nickel  metal  "stampers”  are  produced  by  a  plating  process.  These 
stampers  are  then  utilized  in  an  injection  mold  process  to  stamp  one  side  of  the  disc. 

*  Portions  of  this  paper  originally  appeared  in  [1].  Permission  to  include  them  here 
for  completness  was  kindly  granted  by  Knowledge  Industry  Publications,  Inc. 
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After  the  addition  of  a  reflective  coating,  two  sides  are  joined  together  to  form  a  single 
two-sided  disc. 

The  mastering  and  replication  of  videodiscs  requires  controlled  clean-room  environ¬ 
ments  and  major  investments  in  sophisticated  equipment  and  facilities.  It  is  a  publica¬ 
tion  process.  (It  is  interesting  to  note  that  most  traditional  publication  processes 
utilize  some  form  of  "stamping",  be  it  print  on  paper,  or  music  on  records.) 

Production  capabilities  for  the  DVA/Philips  formatted  videodiscs  are  presently  avail¬ 
able  through  both  DVA  and  SONY.  Coses  for  creating  the  master  may  vary  between  $2,200  to 
$4,000  or  more  per  side  depending  on  the  amount  of  pre-editing  or  processing  required. 
For  small  quantities  DVA  is  quoting  replication  costs  of  $10  dollas  per  disc  (two  sides). 
Costs  for  large  quantities,  presently  determined  by  negotiation,  should  drop  sharply  with 
the  size  of  the  production  run.  3M  Corporation  is  to  establish  mastering  facilities  for 
the  production  of  Thompson  discs  in  the  U.  S. 

C.  Formats  and  Players 

Videodisc  players  and  disc  formats  may  be  differentiated  as  to  Consumer  or  Industrial 
(or  Institutional)  types.  The  consumer  and  industrial  players  differ  in  the  amount  of 
control,  intelligence,  signal  quality  and  robustness  offered.  Consumer  models  sell  for 
around  $800,  while  single  unit  costs  for  industrial  models  are  around  $3,000.  It  is  the 
latter  that  will  be  of  primary  interest  to  the  IS&R  community.  Consumer  model  players  are 
presently  being  marketed  by  Magnavox  and  Pioneer.  Industrial  players  are  being  marketed 
by  SONY,  DVA,  and  Thompson  CSF.  The  newly  announced  RCA  videodisc  players  employ  a  physi¬ 
cal  stylus  and  grooved  videodisc  records;  they  are  of  no  potential  interest  for  IS&R  since 
they  preclude  random  access  and  still  or  "freeze"  frame  capabilities. 

Industrially  formatted  discs  contain  concentric  tracks  with  one  video  frame  per 
track.  Each  side  of  a  30  cm.  (12  inch)  videodisc  contains  54,000  frames  in  the  USA  stan¬ 
dard  video  format,  or  47,000  frames  for  the  higher  resolution  European  video  formats. 
This  one  frame  per  track  format  is  that  which  allows  random  access  and  freeze  frame  capa¬ 
bilities,  both  of  critical  importance  for  IS&R  as  discussed  later.  At  30  frames  per 
second,  the  USA  formatted  discs  have  a  video  playback  time  of  one  half  hour  per  side. 

DVA  has  also  introduced  an  optical  disc  format  for  the  entertainment  market  that 
allows  one  hour  of  continuous  playback  per  side,  or  two  hours  per  disc.  This  format 
encodes  the  frames  in  a  continuous  spiral,  and,  like  the  RCA  videodisc  system,  is  of  no 
interest  for  IS&R  since  it,  too,  precludes  random  access  and  freeze  frame  operation. 


IV.  DIGITAL  STORAGE 

While  the  spur  to  the  development  of  the  optical  disc  technology  was  the  entertain¬ 
ment  market,  it  was  also  recognized  that  this  technology  could  be  exploited  to  store  digi¬ 
tal  information.  Two  approaches  have  been  identified:  one  is  to  encode  digital  informa¬ 
tion  onto  a  standard  video  signal  and  then  obtain  the  videodiscs  via  the  mastering  and 
replication  process  previously  described.  This  approach  will  be  discussed  in  Section  B. 
The  desirability  of  recording  digital  information  on  site  at  the  computer  was  also  recog¬ 
nized  and  a  methodology  for  accomplishing  this  was  "Initially  developed  by  Philips  Labora¬ 
tories,  North  American  Philips,  under  contract  to  the  USA  Department  of  Defense's  Advanced 
Research  Projects  Agency  (ARPA) .  This  second  approach  will  be  designated  "optical  digital 
disk*  technology  to  distinguish  it  from  the  digital  applications  of  the  "optical  video¬ 
disc". 

A.  Optical  Digital  Discs 

1.  Philips  DRAW  Process  The  Philips  optical  digital  disc  technology  15,6]  utilizes  a  spe¬ 
cially  prepared  disc.  Two  glass  or  plastic  discs  are  coated  on  the  inside  surfaces  with  a 
special  reflective  metal  film  (usually  tellurium),  separated  by  an  air  space,  and  hermeti¬ 
cally  sealed.  The  optical  digital  disc  is  mounted  in  a  recorder/player  connected  to  the 
computer.  A  relatively  high-powered  laser  beam  is  focused  through  the  protective  surface 
so  that  the  information  is  "burned"  into  the  inside  information  surface.  In  this  record¬ 
ing  process,  the  disc  carries  its  own  "clean  room"  with  it.  Another  advantage  is  the 
ability  to  read  the  data  right  after  it  is  recorded.  Philips  has  designated  this  the 
"Direct  Read  After  Write"  or  "DRAW"  (trademarked)  process.  Once  the  information  has  been 
burned  into  the  information  surface  it  cannot  be  erased  and  is,  consequently,  "read¬ 
only".  In  the  Philips  DRAW  information  system,  data  are  recorded  on  40,000  tracks  per 
side  of  a  12  inch  (30  cm.)  disc  for  a  total  capacity  of  10  billion  bits  (1  billion  charac¬ 
ters)  per  side.  Each  track  of  the  optical  disc  contains  32  addressable  sectors  of  15,200 
bits  each.  Research  is  underway  to  extend  the  data  storage  to  100  billion  bits  (10  bil¬ 
lion  characters)  per  side.  A  more  detailed  discussion  of  the  DRAW  process  may  be  found  in 
(6] . 

2.  DREXON  draw  Discs  The  disc  used  in  the  Philips  DRAW  process,  described  above,  is  sus- 
ceptible  to  deterioration  if  air  permeates  the  seals  between  the  two  surfaces  and  oxidizes 
the  metallic  film  containing  the  information.  A  new  process,  developed  by  Drexler  Tech¬ 
nology  Corp.,  (7-9],  utilizes  extremely  small  metal  spheres  and  filaments,  distributed 
throughout  a  polymer.  The  absorption  of  the  laser  light  by  the  metal  filaments  (silver 
halide)  causes  the  local  reflectivity  to  change.  These  micron-wide  areas  of  lower  reflec¬ 
tivity  correspond  to  the  "pits"  in  the  Philips-type  disc.  Since  the  metal  particles  are 
dispersed  within  the  disc,  however,  DREXON  discs  are  impervious  to  oxidation  [10].  As 
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another  advantage)  low  power  diode  lasers  can  be  used  to  both  write  and  read. 

In  the  future,  this  technology  may  also  be  applied  to  the  production  of  videodiscsr 
or  allow  the  inexpensive  replication  of  DRAM  discs  by  using  a  transmissive  DREXON  master. 
Copies  would  be  achieved  by  exposure  of  light  through  the  master.  Today,  in  small  quanti¬ 
ties,  DREXON  12  inch  diameter  discs  cost  $3,500  each. 

3.  Digital  Encoding  Since  optical  digital  discs  are  being  developed  solely  as  computer 
mass  storage  devices,  the  information  is  encoded  in  a  form  most  suitable  for  digital 
recording  and  requires  no  compatibility  with  video  signals  or  formats.  The  ability  to 
read  the  data  as  they  are  recorded  allows  the  immediate  identification  of  errors.  The 
errors  may  then  be  "corrected"  by  rewriting  the  data  in  a  new  sector  and  erasing  the 
address  of  the  bad  sector  from  the  computer  memory.  This  does  not  mean  that  the  informa¬ 
tion  can  be  erased  from  the  disk,  but,  rather,  the  knowledge  of  that  sector  can  be  erased 
from  computer  memory.  Hence,  the  user  information  stored  on  an  optical  digital  disc  can 
have  a  very  high  accuracy  (low  error  rate) . 

4.  Applications  The  optical  digital  disk,  therefore,  represents  a  very  high-density, 
low-cost,  archival  mass  storage  medium.  Future  videodisc  storage  units  may  employ  jukebox 
arrangements  for  the  online  access  of  as  many  as  1,000  discs[6].  Also  under  consideration 
are  optical  disc  packs  of  six  double  sided  discs  accessed  by  12  separate  read  heads[6]. 
Each  such  disc  pack  would  have  a  storage  capacity  of  1  trillion  bits  or  100  billion  char¬ 
acters  in  the  same  volume  as  required  today  by  a  magnetic  disc  pack  containing  300  million 
characters.  Optical  disc  packs  will  utilize  smaller,  solid-state  laser  Jiodestll]  instead 
of  the  bulkier  gas  discharge  lasers,  which  will  allow  the  dimensions  between  disks  to  be 
much  closer. 

5.  Availability  Although  announced  in  1979,  problems  have  delayed  the  introduction  of 
commercially  available  "draw"  units.  North  American  Philips  announced  the  availability  of 
prototype  DRAW  Recorder/Player  units  for  $150,000  and  player  units  alone  for  $20,000  [12], 
but  then  later  withdrew  the  offer.  Both  RCA  and  Philips  will  entertain  offers  to  deliver 
"DRAW"  units  but  cost  must  be  negotiated. 

It  is  generally  known  that  other  firms  are  developing  optical  digital  disk  mass 
storage  systems  similar  to  the  Philips  system  but,  as  of  this  writing,  no  other  units  have 
been  publicly  announced.  As  noted  by  Jerome  Drexler,  President  of  Drexler  Technologies, 
"...we  are  currently  supplying  media  to  18  commercial  firms... 16  are  major  companies. " [13] 
A  review  of  other  firms  engaged  in  the  optical  disk  technology  may  be  found  in  [13]. 

B.  DIGITAL  INFORMATION  ON  OPTICAL  VIDEODISCS 

The  potential  for  encoding  digital  information  on  the  standard  NTSC  video  signal  and 
format  was  addressed  by  George  Kenney  in  a  1976  paper  [14].  In  this  paper,  he  pointed  out 
the  economic  advantages  that  could  accrue  by  exploiting  the  already  existing  videodisc 
production  facilities  for  the  publication  of  machine-readable  information. 

There  are,  however,  many  problems  to  be  resolved  in  the  successful  exploitation  of 
the  videodisc  technology  for  the  storage  and  retrieval  of  digital  information.  Paramount 
among  these  is  the  identification  and  correction  of  errors.  Unlike  the  DRAW  process  dis¬ 
cussed  in  the  previous  section,  the  mastering  and  replication  process  for  videodisc  pro¬ 
duction  does  not  allow  dynamic  error  identification  and  correction.  Furthermore,  it  is 
known  that  there  are  many  sources  of  errors  in  the  production  process,  including  errors  in 
mastering  and  stamping,  and  errors  due  to  particulates  in  the  plastics.  Errors  presently 
occur  in  the  production  of  videodiscs  for  entertainment  purposes.  These  errors  (drop¬ 
outs),  while  often  visible  (drop-outs),  are  seldom  obtrusive  to  the  eye  or  ear.  The  sen¬ 
sitivity  of  digital  information  to  errors  is  much  greater,  since  the  loss  or  change  of  a 
single  bit  could  cause  a  computer  program  to  malfunction,  or  an  index  to  return  wrong 
information. 

Another  major  constraint  of  this  recording  mode  is  the  need  to  encode  the  information 
into  54,000  frames  of  basic  video  (NTSC)  format.  While  a  constraint  on  the  one  hand,  this 
is  also  a  major  advantage;  viz.,  the  ability  to  intermix  digital  and  video  information  on 
the  same  disc.  For,  if  the  output  of  the  disc  is  to  be  under  computer  control,  then  there 
must  be  an  index  to  all  significant  (start/stop)  frames.  This  index  will  also  contain 
information  on  the  digital  or  video  nature  of  each  frame;  i.e.,  video  frames,  when 
accessed,  will  be  directed  to  a  video  monitor;  the  output  from  digital  frames  will  be 
directed  to  the  computer  and  then  to  whatever  output  device  is  desired. 

1.  Lister  Hill  Center  Program  Although  Kenney  [14]  identified  the  potential  for  the 
storage  and  retrieval  of  "digital  Information  on  optical  videodiscs  in  1976,  there  has  been 
no  evidence  of  private  sector  developments  in  this  area.  If  realized,  however,  this 
application  of  the  technology  is  of  great  potential  importance  to  the  IStR  community.  It 
can  serve  as  a  publication  process  for  large  databases  and  full  text  document  collections, 
including  mixed  video  and  digital  information.  Towards  this  end,  the  Lister  Hill  National 
Center  for  Biomedical  Communications  of  the  National  Library  of  Medicine  has  initiated  an 
R6D  program  to  address  the  encoding  of  digital  information  on  optical  videodiscs  and  its 
playback  via  the  industrial-type  players.  The  theoretical,  error  free  storage  densities 
for  this  process  are  estimated  to  be  between  20  and  30  billion  bits  or  2  to  3  billion 
characters  per  side.  The  usable  data  per  side  will  depend  upon  the  types  and  magnitude  of 
the  errors  to  be  encountered  in  the  mastering  and  replication  of  the  digitally  encoded 
videodiscs  and  the  error  correction  methodologies  to  be  employed.  An  experimental  facil¬ 
ity  is  being  assembled  at  the  Lister  Hill  Center  to  allow  for  the  identification  of  such 
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errors.  Investigations  will  seek  to  determine  the  effect  of  current  videodisc  production 
standards  on  information  storage;  viz.,  to  what  extent  is  the  theoretical  storage  degraded 
by  errors  encountered  in  mastering  and  replication?  Subsequent  investigations  will  seek 
to  identify  where  improvements  in  the  production  process  will  have  the  greatest  impact  on 
usable  information  storage. 

2.  Videodisc  Interface  Unit  As  part  of  this  R&D  program,  the  Lister  Hill  Center  has 
developed  an  intelligent  videodisc  interface  unit  (VIU)  to  allow  computer  control  of 
industrial-type  videodisc  players  [15].  While  numerous  investigators  are  interfacing 
industrial-type  players  directly  to  microprocessors  or  computers,  the  Viu  offers  improved 
capabilities,  a  set  of  higher  level  commands  for  controlling  the  player,  and  a  degree  of 
device  independence.  The  latter  is  achieved  by  a  modular  design  such  that  different 
industrial-type  players  may  be  interfaced  without  changing  the  higher  level  control  com¬ 
mands  used  by  the  computer  program.  In  production,  the  VIU  should  cost  between  $600  and 
$700.  The  Viu  is  presently  programmed  to  interface  the  DVA  Model  7820  Industrial  Player 
with  either  a  terminal  or  host  computer. 


V.  IMPLICATIONS  FOR  IS&R 

The  optical  digital  disc  and  the  successful  application  of  the  optical  videodisc  to 
digital  storage  promises  to  impact  online  information  retrieval  services  in  a  variety  of 
ways,  some  complementary  and  some  competitively.  The  optical  digital  disc  will  not  only 
be  an  important  adjunct  to  large  centralized  services,  but  also  to  distributed  minicom¬ 
puter  based  systems  where  purchase  of  the  requisite  disc  drives  can  be  justified.  The 
videodisc,  with  its  potential  for  publication  of  machine  readable  data  bases  may  provide 
the  greatest  competition  for  centralized,  online  services.  Even  here,  though,  the  local 
provision  of  graphics,  audiovisuals,  and  full  text  may  be  complementary  rather  than  com¬ 
petitive.  The  most  conspicuous  impact  of  the  optical  digital  disk  will  be  in  terms  of 
online  storage  costs.  In  addition,  the  very  scale  of  online  storage  which  could  be  real¬ 
istically  made  available  will,  by  itself,  generate  new  offerings  and  services. 

A.  Random  Access  Graphics 

One  important  application  of  the  optical  videodisc  will  be  the  local  provision  of 
graphics  to  complement  online  information.  Online  information  delivery  has  traditionally 
suffered,  in  comparison  with  printed  publications,  by  the  absence  of  graphic  material.  The 
desirability  of  including  graphic  material  extends  from  simple  line  drawings,  such  as 
chemical  structures,  to  more  complex  line  drawings,  such  as  in  patent  submissions,  to 
half-tones,  color  images,  and  full  audio-visual  sequences.  The  latter  will  be  of  particu¬ 
lar  importance  as  a  complement  to  full-text  encyclopedic  data  bases  and  computer-based 
educational  material. 

Pergamon  International  publicly  demonstrated  in  March,  1981,  an  online  system.  Video 
PATSEARCH  (15),  for  searching  the  700,000  USA  patents  certified  since  1971.  Subscribers 
to  PATSEARCH  will  obtain  a  special  intelligent  terminal  with  attached  videodisc  player 
(presently  a  DVA  model  7820)  and  eight  videodiscs  containing  the  drawings  of  all  700,000 
patents.  The  USA  patent  database  is  accessed  by  a  user  through  an  online  vendor,  BRS, 
Inc.  After  logging  the  terminal  into  the  online  database,  and  mounting  the  appropriate 
videodisc,  the  user  may  call  up,  alternately,  the  patent  text  (citation)  or  the 
corresponding  patent  drawings  via  the  local  videodisc.  The  Pergamon  terminal  also  has 
sufficient  intelligence  to  allow  a  tailored  user-cordial  interface  [17]  which  minimizes 
the  knowledge  the  end-user  requires  to  effectively  negotiate  the  necessary  search  stra¬ 
tegies  and  access  the  corresponding  drawings.  The  Canadian  Patent  Office  also  plans  to 
install  and  test  VIDEO  PATSEARCH  [16]. 

VIDEO  PATSEARCH  is  the  first  commercially  marketed  application  of  videodisc-supported 
graphics  for  online  IS4R.  It  is  certain  to  be  followed  by  many  other  applications  such  as 
catalogs  of  satellite  imagery,  museum  collections,  medical  illustrations,  etc.  Each 
videodisc  will  have  a  capacity  for  approximately  100,000  images. 

B.  Online  Digital  Storage 

1.  Economics  Economic  savings  may  be  realized  by  a  reduction  in  the  media  cost,  in 
equipment,  and  in  physical  floor  space.  While  many  discussions  of  computer  memory  tech¬ 
nology  emphasize  the  cost  of  the  media,  the  cost  of  online  storage  is  primarily  a  function 
of  equipment  (e.g.,  disc  drives).  In  fact,  some  of  the  newer  magnetic  disc  storage  sys¬ 
tems  such  as  the  IBM  3370  and  the  CDC  33502  are  fixed  disc  systems  and,  hence,  the  medium 
is  integral  with  the  drive.  Even  the  floor  space,  herein  estimated  at  $25  per  square 
foot,  or  $600  per  disc  drive,  is  small  when  compared  to  the  cost  of  the  drives,  as  shown 
in  Table  I.  The  cost  for  the  IBM  and  CDC  disc  drives  were  obtained  from  USA  Government 
Schedules  and  are  to  be  used  only  for  comparison.  In  each  case,  the  cost  of  the  control 
unit  was  averaged  over  the  maximum  number  of  drives  that  could  be  attached.  The  costs  in 
Table  I  do  not  include  the  additional  equipment  in  the  central  processor  (e.g.,  extra  I/O 
channels  or  special  interface  units)  needed  to  support  additional  control  units. 

Table  I,  therefore,  provides  a  gross  picture  of  some  economies  that  could  be  expected 
to  accrue  from  the  present  and  near  future  storage  densities  of  the  optical  digital  disc. 
Online  storage  can  account  for  50  percent  or  more  of  the  total  equipment  costs  used  by  an 
online  service.  The  NLM  maintains  in  excess  of  10  billion  characters  of  online  storage. 
Even  so,  the  number  of  online  databases  maintained  by  NLM  is  far  fewer  than  those  main¬ 
tained  by  the  major  online  services  such  as  SDC  and  Lockheed.  As  shown  in  Table  I,  the 
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TABLE  I.  Cost  Comparisons  for  Online  Storage 


$K  per 

$K  per 

Mbytes 

$K/Drive 

10  Gbvtes* 

100  Gbytes 

1973 

IBM  3330-11 

200 

28 

1,430 

14,300 

1975 

IBM  3350 

317 

43 

1,300 

13,000 

1978 

CDC  33502 

635 

27 

430 

4,300 

1979 

IBM  3370 

571 

29 

480 

4,800 

1981 

IBM  3380 

2,500 

98 

392 

3,920 

1982** 

Philips  Disk 

2,000 

20** 

100 

1,000 

1983** 

Philips  Pack 

100,000 

50** 

51 

51 

*  gigabytes  or  billion  characters 
**  estimates 


most  dramatic  economies  to  be  foreseen  are  still  over  a  year  away  and  are  predicated  on 
the  availability  of  a  1  trillion  bit  (100  billion  character)  optical  disc  pack. 

2.  Access  to  Full  Text  One  qualitative  trend,  given  the  availability  of  low-cost  mass 
storage  systems  such  as  the  optical  discs,  will  be  the  greater  online  availability  of  full 
text.  Estimates  for  the  transition  of  scientific  journals  and  documents  to  full  elec¬ 
tronic  printing  (including  graphics)  are  10-15  years.  However,  the  technology  presently 
exists  for  storing  journal  copy  in  high  resolution,  compressed  facsimile  format  to  support 
online  browsing  or  computer -driven  demand  publication.  Another  major  program  effort 
within  the  Lister  Hill  Center,  "The  Electronic  Document  Storage  and  Retrieval  Program",  is 
directed  towards  the  intgeration  and  further  development  of  these  technologies.  The  high 
storage  density  and  random  access  characteristics  of  the  optical  disc  are  driving  technol¬ 
ogy  developments  in  this  direction. 

Where  the  full  text  is  available  in  machine  readable  form,  the  optical  disc  will  not 
only  provide  an  economical  storage  medium,  but  has  the  additional  potential  for  filling  a 
long  standing  gap  in  the  effective  utilization  of  associative  text  processing.  Previous 
efforts  in  associative  text  processing  have  concentrated  on  the  com*' iter  processors.  A 
major  limitation  has  been  the  lack  of  relatively  low-cost,  random  access  storage  units 
with  both  sufficient  capacity  and  data  rates.  Optical  digital  discs,  and  disc  packs,  com¬ 
bine  both  the  high  data  rates  and  large  storage  capacities  required  for  these  applica¬ 
tions.  Recent  developments  (18]  have  demonstrated  data  rates  of  30  Megabits  per  second 
with  capacities  in  excess  of  30  billion  bits  per  12  inch  disc,  and  data  systems  with  data 
rates  of  60  Megabits  are  shown  to  be  feasible. 

Just  how  associative  text  processing  will  best  be  employed  -  for  online  access  to 
text,  or  preprocessing  of  text  for  the  creation  of  indices,  or  both  -  is  still  unclear. 
It  is  only  certain  that  the  availability  of  storage  technologies  such  as  these  will  give 
impetus  to  their  reconsideration. 

Another  potential  application  will  be  the  local  or  regional  provision  of  full  text 
(with  graphics)  as  a  complement  to  online,  centralized  searching.  The  latter  implies  high 
speed,  sophisticated  processing  requirements  which  may  be  best  centralized,  while  the  pro¬ 
vision  of  audiovisual  segments  and  full  text  with  graphics  to  the  remote  user  implies 
major  communications  requirements  and  may  best  be  decentralized.  Since  communications 
are,  in  most  countries,  tariffed,  the  economic  trade-offs  between  searching  and  informa¬ 
tion  delivery  may  not  be  dictated  by  technology  alone. 

C.  Distributed  Vs.  Centralized  Information  Delivery 

The  potential  for  distribution  of  machine  readable  information  via  optical  videodiscs 
or  optical  digital  discs  may  appear  as  a  threat  to  centralized,  online  services.  Given 
the  alternatives,  decisions  will  be  based  on  considerations  similar  to  those  dictating  the 
choice  of  in-house  vs.  time-shared  computing  services.  These  considerations  are  primarily 
economic  and  include  the  cost  of  computer  equipment,  communications,  personnel,  and  the 
frequency  of  need  for  the  information  at  the  going  market  rate. 

For  the  distributed  delivery  of  full  text  information,  the  difference  between  the 
optical  digital  disc  and  the  proposed  digital  application  of  the  optical  videodisc 
includes  both  publication  and  equipment  cost.  The  optical  digital  disc  can,  at  present, 
be  replicated  only  by  the  sequential  copying  from  one  disc  (or  pack)  to  another.  This 
should  pose  no  major  impediment  to  regional  distribution  of  information,  but  would  be  very 
costly  in  quantities  of  100  or  more  copies.  Mass  publication  in  such  quantities  (or 
larger)  would  be  in  the  domain  of  the  optical  videodisc  technology.  Similarly,  for  single 
users,  exploitation  of  the  commercially  available  industrial  players  may  greatly  reduce 
equipment  costs.  Such  players,  on  the  other  hand,  will  not  effectively  support  multiple 
users. 


1.  Microform  Surrogate  As  soon  as  the  storage  of  journal  and  document  pages  is  con¬ 
sidered,  we  are,  by  implication,  addressing  a  potential  surrogate  for  microforms.  While 
the  optical  discs  will  have  better  random  access  capabilities,  will  they  compare  economi¬ 
cally  with  microforms?  In  order  to  obtain  such  a  comparison  it  will  be  necessary  to  make 
some  assumptions  regarding  the  amount  of  digital  storage  required  per  journal  page. 
Although  normal  facsimile  reproduction  employs  a  scanning  resolution  up  to  200  lines  per 
inch  (40,000  bits  per  sq.  in.),  this  is  insufficient  for  high  fidelity  archival  storage  of 
journal  pages.  It  is  estimated  that  for  good  reproduction  of  text  (not  half-tones  or  color 
graphics),  approximately  500  lines  per  inch  (250,000  bits  per  sq.in.)  will  be  required  to 
accurately  reproduce,  with  excellent  quality,  the  smallest  characters  found  in  scientific 
journals.  Estimates  for  digital  compression  for  these  resolutions  range  from  20:1  to 
100:1.  Hence,  assuming  a  scanning  resolution  of  500  lines  per  inch,  a  50:1  compression 
ratio,  a  page  size  of  8"  x  10",  and  a  disk  storage  of  20  billion  bits,  will  result  in 
50,000  stored  pages  per  disk.  Depending  on  quantity,  the  videodiscs  will  cost  between  $2 
and  $10.  Hence,  cost  will  vary  between  $.04  and  $.20  per  1,000  pages.  By  comparison,  a 
standard  24x  microfiche  contains  approximately  100  frames  per  transparency,  while  the  NCR 
ultrafiche,  as  used  by  the  British  Library  for  their  "Books  in  English”  publication  (20], 
contains  2,380  frames  per  transparency.  Assuming  a  cost  of  fiche  duplication  at  $.25  per 
transparency  for  standard  fiche  and  $.50  per  transparency  for  ultrafiche,  this  results  in 
costs  of  from  $.20  to  $2.50  per  1,000  pages.  Hence,  in  terms  of  the  media,  the  videodisc 
may  be  an  economic  competitor  to  microform  for  large  collections. 

It  must  be  noted  that  the  difference  in  equipment  costs  will  preclude  competition 
between  microforms  and  optical  discs  for  the  individual  user  for  the  foreseeable  future.  A 
microfiche  reader  can  be  very  inexpensive.  Access  to  information  stored  on  optical  disc 
will,  on  the  other  hand,  require  optical  disk  players,  high  resolution  displays  and/or 
printers,  and  some  form  of  computer  (or  microprocessor)  capability.  It  can  be  expected, 
nonetheless,  that  videodiscs  will  compete  with  microform  in  (a)  regional  centers  that  sub¬ 
scribe  to  major  microform  document  collections  such  as  ERIC  (Educational  Research  Informa¬ 
tion  Center)  and  provide  for  local  document  delivery  by  nicroform  duplication  and  hardcopy 
blow-back,  and  in  (b)  libraries  and  information  centers  that  have  cost-justified  the 
acquisition  of  the  appropriate  equipment.  It  will  also  be  possible  to  generate  microforms 
by  COM  (computer  output  microform)  directly  from  the  optical  disk  storage. 

2.  Document  Delivery  In  a  previous  section,  the  optical  digital  disc  was  discussed  as  a 
potential  document  storage  medium.  There  exists  a  natural  extension  to  thiB  in  terms  of 
document  delivery.  Given  the  facsimile  storage  of  the  documents  and  journals  as 
described,  there  exists  today  computer-driven  xerographic  laser  printers  by  IBM,  Siemens, 
and  Xerox  that  could  print  this  information  at  rates  of  up  to  2  pages  per  second  (ca.  14 
million  pages  annually  per  machine  per  single  shift  of  40  hours  per  week) .  Of  these 
machines,  the  Xerox  9700  laser  printer  has,  at  present,  the  highest  resolution  of  300 
lines  per  inch.  A  version  of  this  machine,  modified  to  handle  card  stock,  is  employed  by 
the  Library  of  Congress  to  publish  catalog  cards  from  the  MARC  database.  While  these 
machines  are  expensive  (ca.  $350,000),  high  volume  output  brings  the  cost  per  page  to  1 
cent  or  less.  It  is  safe  to  say  that,  in  the  future,  a  spectrum  of  lower  cost,  albeit 
slower,  printers  will  be  available  for  distributed  printing. 

Towards  this  end,  whether  the  documents  are  printed  centrally,  or  the  information  is 
transmitted  over  document  delivery  networks  to  local  printers,  or  distributed  via  video¬ 
discs  for  local  access  and  printing,  will  be  a  matter  for  future  determination.  The 
deciding  factors  may  be  political/economic  rather  than  technical;  e.g.,  the  establishment 
of  communications  tariffs. 

Note,  however,  if  centralized  distribution  is  utilized,  then  videodisc  publication  is 
not  required  and  the  higher  storage  densities  of  the  optical  digital  disc  could  be 
employed.  Extending  our  previous  calculation  for  the  page  storage  to  the  higher  densi¬ 
ties,  and  assuming  1,000  pages  per  journal  per  year,  the  figures  in  Table  II  are  obtained. 


TABLE  II.  Estimated  Page  Storage  Densities 


Billion 

Journal 

Characters 

Page  8 

Volumes 

1980 

Disk 

2 

50,000 

50 

1982* 

Disk 

20 

500,000 

500 

1983* 

Disc  Pack 

100 

2,500,000 

2,500 

*Estimated 


3.  Hybrid  Network  Architecture  In  order  to  clarify  the  concept  of  a  hybrid  network 
architecture,  as  intended  here,  the  example  of  library  shared  cataloging  will  be  explored: 

It  is  not  reasonable  to  take  pictures  of  library  catalog  cards  for  storage  as  video 
frames  on  a  videodisc  because  of  the  lack  of  resolution  on  standard  video  monitors.  If 
one  assumes,  however,  that  a  catalog  entry  has,  on  the  average,  500  or  less  characters, 
then  a  video  or  optical  disk,  with  a  storage  capacity  of  2  billion  characters  could  store 
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the  contents  of  4  million  catalog  entries. 

While  there  are  very  few  individual  libraries  with  4  million  catalog  entries,  all 
libraries  may  have  the  need  to  share  access  to  machine  readable,  regional  and  national 
union  catalogs.  This  potential  of  inexpensively  duplicating  such  large  collections  of 
catalog  information  may  result  in  new  approaches  to  shared  cataloging.  For  example,  the 
consideration  of  network  strategies  in  such  areas  as  shared  cataloging  may  be  seen  to 
result  from  two  conflicting  premises: 

1.  To  maintain  currency,  the  catalog  should  be  maintained  and  updated  centrally. 

2.  It  is  impractical  and  undesirable  to  service  a  nation  of  online  users  from  one, 
centralized  computer  center. 

Regarding  the  first  premise,  it  is  obviously  desirable  that  a  librarian  wishing  to  catalog 
a  new  acquisition  have  access  to  the  efforts  of  another  librarian  who  has  cataloged  that 
item  even  minutes  or  seconds  before.  Regional  network  strategies  have  been  recommended  to 
resolve  the  conflict  between  these  two  premises.  In  such  strategies,  the  database  would 
be  replicated  at  each  regional  center  to  service  online  users  in  each  region.  The 
regional  centers  would  maintain  currency  among  themselves. 

The  potential  of  utilizing  the  optical  videodisc  as  an  inexpensive  publications 
medium  could,  in  the  future,  provide  an  alternative  network  strategy.  This  alternative 
strategy  may  incorporate  three  hierarchical  levels:  the  central  update  facility,  a  local 
(or  regional)  storage  of  "recent"  additions,  and  a  cumulative,  historical  collection  on 
videodisc.  In  such  a  strategy,  the  optical  or  videodisc-based  historical  collection  will 
be  published  periodically  (monthly,  bimonthly,  quarterly,  etc.).  The  "recent"  storage 
will  be  standard  read/write  storage  and  will  be  updated  via  the  network,  weekly,  nightly, 
etc.  from  the  central  facility.  A  catalog  query  to  the  site  serving  the  user  will  first 
search  the  historical  and  "recent"  catalogs.  If  no  local  information  is  found,  then  the 
request  will  be  automatically  routed  to  the  central  facility.  The  frequency  of  videodisc 
publication  and  "recent"  catalog  updates  will  be  chosen  to  maintain  a  reasonable  rate  of 
transactions  at  the  central  facility. 

Hence,  hybrid  network  architectures  may  include  access  to  distributed  historical  and 
recent  data,  while,  at  the  same  time,  provide  centralized  access  to  the  most  current  data. 

D.  Office  and  Local  Archive  Systems 

1.  Systems  The  DRAW  system,  pioneered  by  North  American  Philips  and  discussed  in  Section 
III. A  utilizes  relatively  powerful  gas  lasers  and  sophisticated  tracking  or  indexing 
mechanisms  to  insure  precise  positioning  of  each  track.  Two  other  systems  are  under 
development  that  utilize  "pre-grooved*  optical  discs.  These  systems  allow  less  expensive 
and  more  robust  tracking  mechanisms  and  are  designed  for  local  office  environments.  One 
system  has  been  under  development  by  Philips-Eindhoven  [19]  for  a  number  of  years.  The 
other,  by  Toshiba  (Japan)  [13],  was  demonstrated  at  the  1981  National  Computer  Conference 
and  should  reach  the  market  within  two  years. 

2.  Uses  Both  systems  are  intended  to  support  the  archival  storage  of  local  correspon¬ 
dence  and  other  office  documentation.  The  Toshiba  unit  contains  integrated  facsimile  and 
laser  printing  unics.  The  former  to  scan  document  images  which  are  then  stored  on  disc, 
and  the  latter  to  print  documents  on  demand  from  a  disc.  The  resolution  presently 
employed  is  approximately  200  lines  per  inch  (7  lines  per  mm.).  While  sufficient  for 
typewritten  material,  it  is  not  sufficient  resolution  for  archival  quality  journal  page 
images,  hence  the  emphasis  on  office  systems.  Such  systems  should,  however,  provide  good 
archival  storage  and  source  data  capture  for  many  of  the  applications  discussed  in  the 
Artemis  report  [20]. 

E.  Preservation 

A  major  advantage  of  optical  discs  is  their  potential  archival  quality.  Accelerated 
aging  in  the  laboratory  has  been  inconclusive  in  determining  actual  lifetimes.  At 
present,  lifetime  appears,  for  all  practical  purposes  to  be  unlimited  for  videodiscs. 
Oxidation  of  the  substrate  in  Philips-type  DRAW  discs  was  discussed  in  Section  IV. A. 2,  as 
well  as  new  developments  that  may  eliminate  this  potential  problem  such  as  the  DREXON  draw 
discs. 


Even  if  the  discs  do  not  provide  unlimited  archival  lifetime,  the  manner  in  which  the 
data  is  stored  (digitally)  will  allow  periodic  regeneration  of  the  discs  without  the 
attendant  loss  in  quality  now  associated  with  different  generations  of  images  on  photo¬ 
graphic  film.  This  "regeneration  without  loss  of  quality”  will  be  an  expected  occurrence 
as  improved  materials  are  found. 


VI.  Conclusions 

Optical  disc  technology  portends  many  dramatic  improvements  for  ISsR  in  the  Eighties. 
It  should  also  be  clear  from  the  preceding  discussions  that  the  promise  of  the  optical 
disc  technology  for  digital  storage  is  the  driving  force  for  many  complementary  technolo¬ 
gies  such  as  high  resolution  digital  scanning  and  compute r -dr iven  laser  printing. 


L 
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The  optical  disc,  itself,  is  an  almost  ideal  medium  for  IS&R.  If  it  were  necessary 
to  choose  between  read/write  or  read/only-archival  storage,  all  other  aspects  being  equal, 
the  latter  may  be  far  more  important.  User  access  to  online  information  is,  for  the 
overwhelming  majority  of  cases,  read-only  anyway;  i.e.,  the  user  is  neither  allowed  nor 
does  he  have  a  need  to  change  the  source  of  the  information. 
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USERS  REQUIREMENTS  TOR  TERMINALS;  SELECTION,  USB  AND  ERGONOMIC  FACTS 

H.J.LEEHEEK, 

Institute  for  Perception  TNO 
Soeatarberg, 

THE  NETHERLANDS. 

INTRODUCTION 

In  aiany  procaaa- control  situation*  a  computer  ta  used  to  aaalat  the  men  operating  the  ayatem.  The  varloue 
taeke  can  then  be  allocated  between  man  and  machine:  the  machine,  or  the  computer,  to  undertake  the  dull, 
repetltloue  or  fatiguing  tanka;  the  human  operator  to  undertake  the  aupervleory  and  declalon-maklng  taeke 

The  elemente  of  the  taeke  thue  designed  and  asalgtied  to  the  human  operator  are  the  perception  of  the 
machine  output  elgnale  and  the  Initiation  of  a  desired  state  through  control  actions.  Such  man-machine 
Interfaces  can  be  deecrlbed  by  the  cycle  given  In  Fig.  1. 

DECISION 

processing 


EXECUTION 

Fig.  1  The  ergonomic  man  -  machine  cyole 

The  machine  presents  Information  to  the  operator,  who  must  perceive  It;  then  process  the  Information  and 
make  a  decision.  The  operator  must  Implement  the  decision  by  control  actions  using  hla  limbs.  The 
machine  must  then  respond  to  the  control  action  and  the  new  state  shell  be  Indicated  by  signals  which 
again  muat  be  perceived  by  the  operator . etc. 

This  chain  ts  as  strong  as  lte  weakeat  link.  No  part  In  the  cyole  can  be  neglected.  Nevertheless  man 
machine  cooperation  often  falls  by  bad  adaptation  of  the  machine  to  human  capabilities  and  limit¬ 
ations.  This  occurs  In  the  man  machine  interface;  the  terminal, 

Ergononomlats  study  these  odatatlon  aapecta  and  they  can  give  a  great  many  recommendations  and  rules 
of  thumb  for  an  optimal  cooperation  between  men  and  maehlne. 
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In  Pig.  1  *  horizontal  zona  in  tha  nan-nachlna  cycle  Is  drawn.  It  represents  the  function  of  a  terminal 
In  this  cycle.  It  is  clear  that  terminals  have  output  and  input  devices.  Some  of  these  possibilities 
ore  shown  in  Fig.  2. 


Fig.  2  Typical  output  and  input  devices  of  a  terminal 

All  input  nnd  output  device,  ought  to  be  adapted  to  the  operator  of  the  terminal.  It  would  be  a 
shortcoming  of  the  system  if  the  operator  had  to  be  adapted  to  the  terminal. 

In  thla  paper  typical  human  aspects  of  terminals  will  be  discussed. 

In  many  cases  conventional  processing  and  machine  handling  1.  replaced  by  computer  handling.  Thla  paper 
will  discuss  the  typical  new  computer  terminals.  At  first  a  description  of  the  present  state  of 
engineering  will  be  given,  also  an  expectation  of  future  equipment  will  be  briefly  mentioned. 

There  Is  a  tendency  to  use  more  keyboard  input  devices  Instead  of  a  great  diversity  of  buttons,  switches, 
wheels,  etc. 

So  It  Is  unavoidable  to  study  workstations  with  visual  display  units  connected  with  keyboards. 

Very  often  people  think  that  these  device#  are  so  new  end  so  stypicsl  that  recommendations  for  convent- 
onel  equipment  fall  to  cover  the  problems. 

However  moat  conventional  recommendations  and  rules  of  thumb  can  be  directly  applied  to  modern  devices 
like  VDU's. 

KINDS  OF  TERMINALS 

Several  types  of  terminals  can  be  distinguished  with  respect  to  Information  presentation.  Fig.  3  gives 
further  details  of  the  posslhl 1 Itiee  of  typical  computer  terminals  ss  well  ae  more  conventional  devices. 

A  division  can  he  made  hetween  1. auditive  Information  output;  bells,  butzera,  sirens,  etc.  end 

3.  visual  information  output;  a)  signal  lights,  Instruments,  dials,  etc. 

b)  symbol*,  graphs,  etc. 

At  the  end  of  the  paper  some  remarks  on  auditive  Input  and  output  will  be  glvsn.  However  the  main  pert 
will  be  a  discussion  on  typical  visual  aspects. 


Fig.  3  Kinds  or  terminal  output 
CRT  =  Cathode  Ray  Tuba 
LED  =  Light  Emitting  Dloda 
LCD  =  Liquid  Crlatal  Dlaplay 


VISUAL  ASPECTS 

tt  la  poaalbla  to  baaa  rocoamandatlona  and  rulas  of  thumb  tor  visual  information  praaantatlon  on  a 
description  of  the  visual  fiald  capabilities  of  the  human  observer. 

An  Image  of  the  environment  is  present  on  the  eye’s  retina.  The  characteristics  of  the  eye.  Including  the 
retina  and  the  nervous  aystem  behind  the  retina,  define  detection  and  recognition  of  objects  In  the 
visual  field. 

Contain.  »nd  luaJjansit 

The  perception  of  objects  depend  on  their  luminances,  but  within  certain  limits  the  contrast  <=  luminance 
ratio)  between  the  different  parts  of  the  field  of  view  defines  the  probability  of  perception. 


A  rule  of  thumb  can  be  given: 
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Table  I  Contrasts 


L10W 

Subjective  contrast  Impression 

1 

no  contrast 

1-3 

low  contrast 

3-10 

medium,  well  visible,  contrast 

>10 

(too)  high  contrast 

Normal  contrasts  are  all  In  ths  range  3  -  10.  Examples:  In  a  newspaper  between  white  paper  and  black 
printing  Ink,  7;  high  quality  printing  on  high  quality  white  paper,  10;  lead  pencil  on  white  paper,  3; 
symbols  on  average  CRT,  3  -  100:  that  often  weans  too  high. 

The  luminance  of  self-luminous  surfaces  Is  almost  indspend  of  the  Illumination  of  the  work  station, 
in  these  conditions  high  contrasts  way  exist. 

The  luminance  of  externally  lighted  surfaces  depends  on  the  Illumination  of  the  work  station;  the 
contrasts  are  mostly  less  high  than  for  self-luminous  surfaces. 

Specular  reflections 

In  a  window  pane  or  In  a  glossy  surface  specular  reflections  can  cause  glare.  The  Image  of  the  lighting 
fitting  or  other  objects  with  high  luminances  is  so  strongly  visible  that  It  causes  a  certain  annoyance 
with  respect  to  the  proper  object  behind  the  pane  or  under  the  glossy  layer. 

When  the  contrast  between  proper  object  and  the  annoying  image  is  greater  than  3-10  the  annoying  glare 
will  be  only  slight  or  negligible. 

Some  Information  presentation  madia  are  so  prone  to  specular  glare  that  one  would  better  not  use  them. 

Visual  acuity  and  Site  of  detail 

The  human  eye  Is  able  to  see  an  object  having  an  angular  slee  of  1  mlnutsof  arc.  However  rather  many 
people  have  lower  visual  acuity  because  of  not  using  their  glasses,  refusing  to  wear  glasses  or  not 
knowing  that  they  can  get  a  better  visual  acuity. 

So  for  practical  use  It  Is  better  to  base  the  Information  presentation  on  a  minimum  critical  detail  of 
2  minutes  of  arc.  This  means  a  detail  of  1  mm  at  a  viewing  distance  of  about  1.3  m. 

The  height  of  symbols  (letters  and  figures)  must  have  at  least  3  critical  details  and  In  general  the 
width  must  have  3  critical  details.  Thus  the  ratio  shall  be  at  least  0.6.  In  oblique  viewing 

a  greater  or  smaller  ratio  is  recommended  dependent  on  the  direction  of  viewing. 

The  legibility  of  a  symbol  increases  when  it  is  built  with  mors  details  and  whan  tha  details  itself 
are  not  too  large.  Some  presentation  media,  like  CRT's  ate.,  build  thalr  symbols  with  such  large 
details  that  the  legibility  is  lass  than  the  optimal. 

A  normally  printed  symbol  can  be  seen  at  a  viewing  distance  of  300x  its  height.  For  a  5  x  7  point 
matrix  symbol  a  viewing  distance  of  150x  Its  height  Is  recesnsended. 


Tabla  II  gives  a  review  of  the  diffarant  Information  davleaa 
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CONVENTIONAL 

CONVENTION. 

AL  4-  NEW 

DIALS 

SIGNAL 

CRT 

LED 

LCD 

PLASMA 

PAPER 

LIGHTS 

colour 

whits 

on 

black 

black 

on 

white 

Internal 

light 

external 

light 

contrast  In  tusk 
1) 

contrast  task  - 

B 

1 

I 

■ 

0  - 

surrounding 

1) 

specular  reflec- 

H 

X  o  - 

0  - 

1 

■ 

■ 

tions  annoyance 
2) 

minimum  size  of 

1 

0 

I 

■ 

I 

X  0  - 

detail 

3) 

H 

■ 

X  0 

1 

■ 

X 

moans : 

i) 

high 

2)  such 

3) 

large 

0 

means : 

i) 

medium 

2)  sodium 

3) 

medium 

- 

naans: 

i) 

low 

2)  little 

31 

small 

Accommodation  range  of  oldtr  people 


In  Kig .  4  the  ranges  of  accomodation  aa  they  ara  avallabla  for  paopla  of  diffarant  ages  ara  drawn. 
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Horizontal ]y,  the  vl awing  distances  art  given;  along  the  vertloal  axle, the  different  age  group* . 
For  such  age  group  horizontal  shaded  bands  Indicate  on  which  viewing  distances  focussing  by  Mans 
of  glasses  Is  posalble.  The  glasses  diopters  are  written  next  to  the  bands. 

Higher  ages  require  higher  diopter  glasses  correction  to  focus  at  ahort  distances  and  this  Mans 
a  decrease  of  the  viewing  distance  range  on  which  it  la  possible  to  focus  with  Just  one  diopter 
correction. 

Warning:  This  figure  Is  no  glasses  prescription.  The  data  given  are  Man  values  and  cannot  be 
applied  for  Individual  persona. 


OTHER  TERMINAL  ASPECTS 

Working  posture 

In  Many  work  stations  bad  working  postures  are  encouraged. 

Recosanendatlons:  Keyboards  Shalt  be  as  thin  as  possible  (wax.  thickness  2.5  cm). 

Keyboards  shall  ha  placed  as  low  as  possible. 

Screens  shall  be  placed  on  a  height  corresponding  to  a  line  of  sight  of  15  degrees 
below  the  horizon. 


Audi?.  «»Pg£tg 

Audio  Information  output  Is  easier  to  perceive  than  visual  output.  However  tt  attracke  the  attention 
of  the  operator  In  such  an  Intrusive  way  that  oudlo  output  Is  sostly  restricted  to  alarm  situations. 
Future  developaients  give  reason  to  think  that  voice  Input  and  voice  output  way  have  a  greater 
application  In  future  automation  processes. 


The  Use  of  Small  Computers  in  Information  Retrieval 
from  Textual  and  Free-Structured  Data-Bases. 


J.  H.  Ashford,  BSc,  PhD,  FGS,  MllnfSc. 

J.  H.  Ashford  Associates  Limited 

72,  Harrow  Lane,  Maidenhead,  SL6  7PA 


Summary  Minicomputer  systems  are  well  established  for  the  storage  and  retrieval  of  bibliographic 
records  and  the  full  text  of  documents.  Examples  are  given  of  a  range  of  applications 
including  the  handling  of  numeric  and  experimental  data.  The  relationship  between  free  text  and  structured 
records  is  examined,  and  the  appearance  of  hybrid  data  forms  is  noted. 

The  potential  for  the  use  of  microprocessor  based  systems  is  not  yet  well  developed,  and  is  limited  by  the 
relatively  high  cost  of  storage  devices,  and  by  the  difficulty  of  providing  flexible  and  reliable  software.  The 
most  promising  developments  are  following  the  minicomputer  systems  in  the  use  of  'package'  software, 
and  in  having  close  links  to  word  processing  systems. 

Future  developments  are  likely  to  centre  on  reducing  costs  of  storage;  substituting  associative  processors 
for  some  of  the  functions  now  performed  by  software;  development  of  user  interfaces,  and  new  methods  of 
search  planning  and  optimisation.  The  use  of  communications  links,  both  locally  and  over  long  distances  will 
be  of  increasing  importance  as  'electronic  publishing1  becomes  accepted  for  a  wide  range  of  information. 


The  position  in  1981  A  survey  was  made  in  1978  -  79  of  the  potential  market  for  package  free  text  storage 
and  retrieval  systems  in  UK;  this  was  published  in  summary  (l)  and  later  more 
fully  (2).  One  of  the  main  conclusions  of  the  study  was  that  '25 %  of  respondents  expected  to  be  using  some 
form  of  free  text  information  system  by  end  1981' .  The  main  application  areas  identified  were  bibliographic 
records;  full  text  of  technical  documents;  commercial  information;  and  special  applications  such  as  legal 
case  studies,  and  accident  and  incident  databases.  With  the  benefit  of  hindsight,  it  can  now  be  seen  that 
while  the  survey  correctly  identified  the  high  rate  of  new  interest  in  text  handling  systems,  it  very  much 
underestimated  the  importance  of  word  processing  developments  and  the  significance  of  equipment  acquired 
for  document  preparation  as  a  minimal  cost  route  to  database  acquisition  for  local  systems.  When  the 
studies  of  potential  markets  for  packages  were  extended  to  North  America ,  a  new  and  important  application 
was  identified  -  the  build  up  and  exploitation  of  document  databases  in  support  of  litigation  in  major  civil 
proceedings.  The  main  application  areas  of  free  text  systems  for  information  storage  and  retrieval  on 
small  computers  are  now  believed  to  be: 

Bibliographic  records 

Catalogues  of  special  libraries  * 

Catalogues  of  report  and  document  collections  * 

Bibliographic  records  and  abstracts  for  abstracts  journals  etc.  and  related 
information  services  * 

Full  text  systems 

Legal  statute,  cases  and  related  documents 
Documents  in  support  of  litigation  * 

Technical  reports,  patents  etc.  * 

Laboratory  and  clinical  records  in  pharmaceutical  development  and  a 
number  of  similar  'health  and  safety'  applications 
Special  systems 

Special  library  circulation  control  systems  * 

Hazardous  chemicals  databases 

Commercial  records  systems  -  markets,  prospects,  proposals,  invoices, 
servicing  reports  etc.  * 

The  applications  marked  *  above  are  particularly  developed  on  minicomputers!  most  of  the  others,  with 
the  exception  of  the  very  large  shared  legal  databases,  are  potential  small  machine  systems.  A  selection 
of  operational  systems  is  listed  in  Appendix  'A'  to  illustrate  the  range  of  applications,  hardware  and 
software  which  have  been  successfully  implemented.  The  online  information  vendors,  offering  'dial-in' 
access  to  large  bibliographic  databases  have  not  in  general  used  minicomputers;  there  is,  however  at 
least  one  'financial  information'  service,  the  TEXTLINE  system  operating  in  the  City  of  London,  which  runs 
on  a  DEC  PDP  11/70  minicomputer. 


At  the  time  of  writing  -  June  1981  -  very  few  microcomputer  based  storage  and  retrieval  systems  for  text 
Information  have  progressed  beyond  small  scale  or  experimental  applications.  Those  of  which  the  writer  is 
aware  are  discussed  later  under  current  developments. 


10-2 


A  useful  distinction  can  be  made  between  the  textual  and  free  structured  databases,  with  which  this  account 
is  mainly  concerned,  and  the  formally  structured  database  systems  used  largely  in  commercial  and 
'numeric'  data  processing,  based  on  the  way  in  which  stored  records  are  indexed  and  accessed  by  the  user. 
The  usual  approach  taken  with  free  text  systems  is  to  form  an  inverted  file  of  all  (or  most)  of  the  'words' 
found  in  the  text.  This  inverted  file,  or  concordance,  then  points  back  to  the  source  articles  from  which  each 
word  was  drawn,  often  at  a  level  of  'chapter;  article;  paragraph;  word' ,  and  is  used  to  support  the  processing 
of  boolean  query  statements  containing  one  or  more  words  as  arguments.  This  method  allows  reference  to 
the  database  to  be  made  on  the  content  of  the  articles,  treating  all  words  as  tokens  of  equal  status.  The 
structured  database  handler ,  on  the  other  hand,  uses  a  predetermined  format  to  allow  definition  -_f  the  role 
of  a  field,  such  as  ' Department'  or  'Cost  Centre' ,  so  that  processes  such  as  listing  all  Departments  in 
sequence  of  Department  Number  are  naturally  facilitated.  (This  subject  is  developed  at  length  in  (3)  and  (4)). 

The  simplest  form  of  free  text  record  is  represented  by  a  wholly  unstructured  'article'  of  continuous  text. 

This  is  rarely  used,  but  the  next  stage,  the  Title  plus  Abstract  form  is  very  common.  Here  only  the  Title 
field  is  role  defined;  all  other  matter ,  including  author ,  source  etc.  is  treated  as  continuous  text  with  the 
abstract.  In  a  bibliographic  record  -  as  used  in  a  catalogue  in  a  special  library,  for  example  -  it  may  be 
desirable  to  distinguish  words  used  in  the  separate  contexts  of  Title;  Authors;  Classmark;  Location;  Date; 
Volume;  Publisher;  Pagination  and  format;  Current  borrower;  Reservers.  These  role  subdivisions  of  free 
text  records  are  achieved  by  setting  up  a  loose  structure  of  named  sections,  where  the  section  is  used  as  a 
limiting  'window'  to  narrow  the  access  to  part' only  of  the  data.  Few  free  text  applications  use  records 
so  much  subdivided  as  the  MARC  International  Exchange  Format  for  catalogue  data ;  however  it  is  becoming 
usual  to  find  five  or  six  sections  in  bibliographic  records;  five  or  more  in  commercial  record  systems; 
up  to  ten  in  personnel  records.  Thus  the  free  text  database  handler  has  moved  towards  the  role  oriented 
approach  of  the  structured  database,  although  without  gaining  the  flexibility  of  amendment  associated  with 
preformatted  'pigeon-hole'  like  structures. 

A  recent  and  interesting  response  to  the  need  for  partial  structure  combined  with  flexibility  in  amendment  of 
rapidly  changing  data  fields  has  been  the  development  of  hybrid  databases,  where  part  of  the  data  is  held  on 
a  free  text  file,  and  accessed  by  content,  and  part  is  held  as  a  series  of  formatted  records  in  one-to-one 
correspondence  on  a  conventional  indexed  sequential  file.  This  combination  is  seen  when,  for  example, 
circulation  control,  with  its  many  small  transactions,  is  added  to  the  handling  of  a  lihrary  catalogue  on 
a  free  text  system.  The  link  between  the  two  parts  of  the  hybrid  is  sometimes  made  directly  within  the 
software  and  is  not  under  control  of  the  user,  sometimes  through  a  visible  connecting  reference  number 
contained  in  both  parts  of  the  record  and  treated  as  a  user  option.  Other  hybrid  systems  of  this  sort  are 
beginning  to  be  used  in  handling  of  membership  record  of  societies,  and  in  the  coupling  of  maintenance 
records  to  textual  databases  describing  plant  and  equipment. 

For  textual  databases  of  medium  scale,  say  ten  million  to  one  thousand  million  characters  of  stored 
information,  the  use  of  minicomputer  systems  is  well  established.  Many  of  these  systems  use  software 
package  products,  alone  or  with  local  adaptations,  as  the  nucleus  of  the  operation.  With  the  exception  of 
academic  institutions,  where  software  is  often  written  from  first  steps  which  could  readily  have  been  had  from 
outside  vendors,  the  package  approach  to  database  software  is  gaining  increasing  acceptance,  not  only  for 
the  cost  savings  it  otters,  but  for  the  assurance  of  reliability  and  adequate  documentation  and  maintenance. 

In  the  course  of  discussion  at  a  recent  conference  (5),  over  twenty  free  text  retrieval  systems  were 
identified  which  had  been  used  successfully  on  a  number  of  sites.  About  one  third  were  available  on  mini¬ 
computers,  but  the  proportion  of  recently  developed  systems  using  minicomputers  was  much  higher. 

Small  databases,  on  the  other  hand,  are  being  treated  in  all  sorts  of  ways.  Some  have  been  fitted  into 
relational  database  handlers  (such  as  INFO,  RAPPORT);  others  are  kept  on  mainframe  systems  using  free 
text  software;  still  others  are  managed  on  bureau  services.  However,  one  of  the  most  rapid  recent  develop¬ 
ments  has  been  the  adaptation  of  word  processors  to  act  as  storage  and  retrieval  systems  of  varying  sizes 
and  degrees  of  sophisti  cation.  A  detailed  account  of  a  carefully  thought  out  series  of  applications  describes 
word  processors  in  an  industrial  environment  (Pilkington  Glass  Limited)  to  provide  both  local  simple  data¬ 
base  systems,  and  a  facility  for  information  transfer  between  a  number  of  widely  scattered  sites.  (6). 

Current  developments  Three  main  areas  of  development  activity  are  running  in  parallel;  the  tidying  up  and 
consolidation  of  the  'main  stream'  free  text  package  systems,  both  on  mainframes 
and  minicomputers;  the  development  of  preprocessing  and  postprocessing  utilities  to  make  the  storage  and 
retrieval  systems  easier  to  use;  and  an  epidemic  of  microprocessor  developments  of  all  sorts,  many  of 
very  doubtful  value,  but  some  likely  to  be  the  stem  for  an  important  range  of  low  cost,  local  or  personal 
database  processors.  The  availability  of  cheap  microprocessor  hardware,  and  relatively  cheap  'floppy' 
disc  storage  up  to  perhaps  two  million  characters  online  at  a  time  is  the  main  current  stimulant  from  the 
equipment  standpoint.  Software  development  is  concerned  more  with  marginal  improvements  and  utility 
programs  than  with  radical  innovations,  at  least  on  a  scale  of  two  years  ahead. 

The  consolidation  of  existing  technology  is  well  illustrated  by  two  of  the  package  systems  available,  or  to  be 
available  on  minicomputers.  The  first,  STATUS,  has  been  in  use  since  1970,  and  in  its  considerably  evolved 
present  form  since  1980.  The  current  version,  STATUS  80  is  regarded  by  the  development  group  as  having 
reached  a  reasonable  limit  to  its  range  of  features,  and  work  on  the  central  system  -  as  opposed  to  its 
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ancillary  and  auxiliary  programs  -  is  concentrating  on  further  gains  in  run  time  efficiency,  better  use  of 
storage  space,  and  adaptation  to  more  machines  and  wider  markets.  SIFT,  on  the  other  hand,  is  a  new  (7) 
package  system  being  developed  by  the  Statens  rasjonaliseringsdirektorat  in  Oslo,  Norway,  to  provide  a 
software  vehicle  for  legal  databases  in  the  first  instance.  SIFT  is  a  fairly  advanced  derivative  of  an  early 
version  of  STATUS,  but  has  undergone  a  good  deal  of  local  revision  and  development.  The  point  of  interest 
for  present  purposes  is  that  in  this  new  initiative,  there  are  no  significant  features  not  already  available 
in  STATUS  80  and  its  auxiliary  software.  (Nor,  incidentally,  is  this  true  only  of  STATUS;  there  are  several 
package  systems  available  with  essentially  the  same  functional  coverage  -  ASSASSIN,  CAOtS,  STAIRS, 
UNIDAS,  for  example  available  in  Europe,  and  BASIS  and  STAIRS  in  North  America).  Most  of  the  systems 
under  active  development  are  becoming  easier  to  use  and  better  organised  -  but  no  major  new  features 
are  appearing.  The  technology  is,  therefore ,  now  stable  so  far  as  the  central  software  is  concerned,  and 
will  probably  not  change  much  until  new  hardware  developments  change  the  operational  context. 

There  is,  however,  a  great  deal  of  work  being  put  in  to  software  auxiliary  systems  to  link  the  user  to  the 
system  in  such  a  way  as  to  make  it  easy  to  use,  or  robust  to  errors  in  data,  or  generally  more  presentable. 
Most  package  systems,  and  many  locally  developed  systems  nowadays  are  aiming  to  provide: 

Input  format  aid,  allowing  'form-filling1  input  of  text  records; 

Input  of  word  processor  texts; 

Input  from  computer  typesetting  tapes;  (not  so  common) 

Output  in  tabular  format; 

Output  to  word  processor; 

Output  to  typesetter; 

Automatic,  or  semi-automatic  index  generation; 

Controlled  language  for  keyword  selection; 

Thesaurus  handler  and  links  into  searching. 

In  addition  one  might  reasonable  expect  to  find,  either  available  or  under  development,  routines  for  the 
efficient  bulk  take  on  of  records  from  external  sources;  query  reorganisation  (unseen  to  the  user)  to 
minimise  search  times  and  comprehensive  security  and  password  control.  Some  systems,  such  as  SIFT 
and  STATUS  now  incorporate  macroprocessor  facilities,  and  in  the  latter,  the  macroprocessor  permits 
the  handling  of  conditional  processes  and  run  time  specialisation,  so  that  completely  new  'verbs'  can  be 
added  to  the  command  language  for  particular  users. 

Now  that  a  microcomputer  with  minimal  storage  on  floppy  discs  can  be  bought  for  less  than  £500.  and  the 
output  fed  to  a  domestic  television  screen,  there  is  no  obstacle  of  any  significance  to  the  do-it-yourself 
programmer  except  the  intrinsic  difficulty  of  writing  accurate,  reliable,  maintainable  and  functionally  well 
conceived  software.  Since  the  difficulties  are  not  obvious  to  the  beginner ,  and  since  the  satisfaction  and 
excitement  of  getting  on^s  first  jobs  to  run  are  considerable,  there  are  many  thousands  of  man-hours  now 
being  spent  on  learning  to  program,  and  on  producing  amateur  software  of  poor  quality  and  limited  utility. 
The  minimum  of  tasks  to  be  undertaken  in  any  program  development  includes: 

Definition  of  functional  requirement; 

Simulation  check  (desk  check)  that  the  functional  specification  is  reasonable; 

Design  of  data  structure,  taking  account  of  storage  efficiency,  access  efficiency, 
and,  in  the  case  of  floppy  discs,  minimising  wear  during  read  or  write  cycles; 

Design  of  program  structure; 

Writing  of  program  code  -  preferably  in  a  language  with  inherent  structural  features 
such  as  PASCAL  or  ALGOL  or  a  structured  version  of  FORTRAN; 

Testing  with  synthetic  data ; 

Testing  with  live  data ; 

Exhaustive  testing  of  error  and  exception  cases; 

Provision  for  security  backup  of  the  data ; 

Documentation  of  the  system  - 
for  the  user ; 

for  the  programmer  who  will  one  day  have  to  maintain  it! 

The  writer  is  concerned  that  a  major  proportion  of  the  microcomputer  information  systems  now  being 
written  and  proudly  demonstrated  will  not  survive  the  promotion  or  transfer  of  their  authors.  All  this  is 
not  to  say  that  one  should  not  learn  to  program  -  but  that  software  intended  for  regular  use  in  situations 
where  its  performance  matters,  requires  professional  attention. 

There  are  sound  and  well  conceived  microprocessor  developments.  The  example  cited  above  of  Pilkington '  a 
use  of  word  processors  merits  study  as  a  good  case  of  careful  requirement  specification  carried  through 
into  practice  with  skill  and  judgement (6).  Lundeen,  in  a  recent  paper,  (9),  cites  a  number  of  systems 
installed  in  libraries  for  cataloguing,  circulation  control  and  other  applications  which  have  been  set  up  by 
software  houses  of  good  reputation.  At  least  two  fairly  comprehensive  free  text  storage  and  retrieval 
packages  running  on  microprocessors  will  soon  be  available  from  teams  with  established  reputations  - 
one  from  Cuadra  Associates  of  Santa  Monica,  Ca. ,  and  the  other  from  the  ASSASSIN  team  at  ICI  Agricultural 
Division  in  Billingham,  UK.  The  best  policy  for  a  little  while  is  probably  to  'wait  and  see'  and  to  greet 
innovations  on  microprocessors  with  reasonable  scepticism  for  the  time  being. 


The  future  with  small  computers  The  best  method  of  forecasting  developments  in  the  computer  industry 

appears  to  be  to  keep  a  list  of  those  features  one  most  wants  which  are 
just  at  the  edge  of  credibility  for  seven  years  hence  -  and  confidently  expect  them  to  turn  up  within  three 
years  if  enough  other  people  want  them  too.  (This  does  not  work  for  compiler  translation  of  natural  lang¬ 
uages;  the  travelling  salesman  problem;  or  a  high  level  language  so  simple  that  the  end  user  can  specify 
his  own  applications  without  the  need  for  DP  types!  These  are  special  cases. )  The  following  assertions 
must  be  taken  to  represent  a  mix  of  intelligent  guesswork  and  'wish  list'  for  this  writer. 

The  really  useful  hardware  developments  will  be: 

Cheap  mass  storage,  possibly  solid  state,  possible  ever  cheaper  discs  under  competition  from  solid 
state  devices.  This  will  allow  us  to  store  very  large  databases  at  reasonable  costs,  for  instance  in 
the  case  of  the  ftill  text  of  the  proceedings  of  legislative  assemblies  -  which  are  at  present  desirable 
but  very  costly.  It  will  also,  at  the  other  end  of  the  scale,  make  individual  or  local  systems  more 
viable  economically. 

Large  scale,  read-only  data  stores  such  as  video  discs  will  alter  the  economics  of  publishing,  possibly 
making  it  more  attractive  to  buy  a  database  than  to  dial  an  online  data  vendor.  Text  and  graphics  will 
then  become  easier  to  interleave  and  electronic  publishing  might  start  to  approach  the  aesthetic 
quality  of  traditional  methods. 

Hardware  modules  will  replace  heavily  used  software,  including  the  embedding  of  parts  of  the  retrieval 
programs  in  'microcode'  or  equivalent;  the  use  of  associative  stores  to  take  over  list  or  index 
searching  tasks;  and  communications  controllers  such  as  X25  handlers  embedded  in  hardware  logic. 

All  of  the  above  developments  require  fairly  large  numbers  of  users  to  want  to  do  much  the  same  sort  of 
thing  with  their  equipment.  This  is  a  normal  feature  of  hardware  innovation,  and  the  buildup  of  a  critical 
mass  of  users  to  make  an  innovation  economic  sometimes  makes  it  look  as  though  new  ideas  come  in  more 
of  a  rush  than  is  actually  the  case. 

Software  innovations  are  much  more  of  a  problem  to  anticipate,  partly  because  software  does  not  have  the 
investment  lead  time  involved  in  mass  producing  cheap  hardware,  and  partly  because  at  any  one  time,  a 
large  number  of  researchers  are  pursuing  parallel  paths  in  different  contexts,  and  it  is  the  coincidence  of 
a  new  theoretical  construct  with  a  clear  application  which  often  appears  to  set  off  the  fast  exploitation. 

(Past  examples  might  include  linear  programming,  critical  path  analysis,  and  indexed  sequential  file 
handlers.  On  the  other  hand,  relation  database  methods  and  word  processing  hung  around  for  a  long  time 
before  a  lot  of  people  suddenly  took  an  interest. )  Consider  as  possibilities: 

Sympathetic  user  interfaces  of  several  types  are  being  developed.  These  include  'standard'  terminals 
which  will  allow  a  user  to  communicate  with  a  number  of  disparate  information  systems  in  a  single 
language;  interface  processors  for  data  centres  to  allow  the  use  of  a  common  command  language  by 
many  users;  intelligent  terminals  to  make  the  tasks  of  searching  less  tedious  by  storing  protocols 
and  standard  inquiries  for  regular  use.  References  (10,11) 

Research  is  going  on  in  several  centres  on  the  use  of  morphological  analysis  in  framing  and 
improving  queries,  and  in  guiding  their  subsequent  processing.  This  approach  consists  in 
essence  of  using  the  form  and  syntax  of  natural  language  problem  statements  to  derive  suitable 
search  frames,  without  taking  account  of  the  semantic  content  of  the  material  presented.  ( 12, 13). 

A  number  of  workers  are  engaged  in  studies  of  automatic  subject  classification  by  clustering  of 
numbers  of  terms  occur ing  in  natural  language  documents.  This  work  has  several  branches,  and 
has  been  going  on  for  some  years,  but  now  there  are  real  improvements  in  computing  efficiency, 
which  has  been  a  stumbling  bloclg  and  the  promise  of  results  on  a  practical  scale.  (14;  includes 
review  and  bibliography). 

The  automatic  analysis  of  texts  and  development  of  '  inferential'  structures  has  now  reached  the  stage 
where  systems  employing  these  techniques  are  beginning  to  offer  real  promise  as  analysers  of  the 
information  implicit  in  large  collections  of  documents,  where  each  document  in  itself  offers  only 
scraps  of  data,  more  or  less  meaningless  in  Isolation.  Applications  are  envisaged  in  diagnostic 
systems;  analysis  of  hospital  oatient  records;  automatic  scanning  and  interpretation  of  broadcast 
monitoring  data.  (15,16). 

. and  as  a  promising  'long  shot' ,  note  the  ASK  Project.  (17).  In  this  study,  the  basic  approach 

is  to  assume  that  a  questioner  is  a  questioner  because  he  has  an  'anomalous  state  of  knowledge'  in 
which  anomalies  are  to  be  satisfied  by  retrieval  of  information  from  a  database.  This  apparently 
oblique  attack  has  resulted  in  a  number  of  insights  into  how  queries  can  be  formulated,  taking  into 
account  semantic  associations  represented  by  clustering  methods,  and  shows  signs  of  potential 
performance  better  than  that  of  conventional  'best  match'  retrieval  systems.  Complementary 
studies  on  retrieval  system  performance  may  prove  to  be  of  value  for  some  years  to  come,  ss  new 
approaches  to  searching  and  classification  are  tried  out.  ( 18) 

Finally  -  and  possibly  most  important  of  all  on  a  medium  timescale  -  the  availability  of  microcomputer 
based  communications  centres  with  local  storage  and  the  facility  for  calling  on  remote  beck  up  stores  when 
needed,  may  well  break  up  the  present  tendency  to  centralisation  of  information  units,  and  lead  to  a 
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redistribution  of  document  storage  to  the  sites  where  it  is  most  used.  The  valuable  concept  of  '  shared 
investment '  in  information,  which  has  lead  directly  or  indirectly  to  such  services  as  the  Library  of 
Congress  databases,  BLAISE,  OCLC,  DIALOG  and  many  others,  will  still  be  retained,  but  users  will  be  enabled 
to  draw  out  their  sections  of  interest  for  local  manipulation ,  and  will  incur  network  costs  and  the  occasional 
messiness  of  on  line  working  only -when  they  need  to  refer  to  exceptional  material. 


Maidenhead;  June  1981 
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Appendix  A  Selected  list  of  applications  on  minicomputer  systems. 

-  These  examples  are  drawn  from  the  list  of  users  of  STATUS  published  by  AERE  Harwell; 

The  author  is  aware  of  parallel  applications  on  CA1RS,  FACTFINDER,  3RIP  and  others,  but 
their  vendors  do  not  publish  convenient  lists  of  their  clients! 

Location  Hardware  Main  applications  Notes 


BNF  Metals  Technology  Ltd  Prime 

Building  Research  Establishment 

Prime 

Finsbury  Data  Services  Ltd  DEC 

Police  Scientific  Dev.  Branch  GEC 
Orion  Corp.  Ltd  DEC 

Ricardo  Consulting  Engineers  ICL 

Rutherford  Laboratory  (SRC)  Prime 
UKAEA  Culham  Laboratory  Prime 


Abstracts  database 
Company  Information 
Library  catalogue 
Library  loans 
Financial  and  industrial 
information  service 
Document  retrieval 
Product  development  database 
Technical  abstracts 
Engine  database 

Engineering  drawing  specifications 
Fusion  experimental  data 


Over  60,000  records 

On  line  access  to 
subscribers 
(Home  Office) 


(Reproduced  by  permission,  AERE  Harwell  Laboratories). 
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Addendum 
(September  1981) 

Following  Is  a  list  of  some  of  the  companies  who  have  Installed  BFMIRA  CAIRS,  their  main 
applications,  etc: 


COMPANY 

APPLICATIONS 

HARDWARE  IN  USE 

SCALE  OF  DB 

British 

Aerospace 

IR/LM/CA/PI/ 

BG/WP 

Texas  990/10 
running  DX10 

11  databases: 
up  to  100,000 
records 

Smith  Kline 
a  French 
(UK  &  USA) 

IR/LM/CA/PI/ 

BG/PD/WP 

PDP  11/70 
running  RSTS 

13  databases: 
up  to  30,000 
records 

Corn  Products 
Corporation 
(USA  a  Belg) 

IR/LM/PI 

Texas  990/10 
running  DX10 

3  databases: 
up  to  12,000 
records 

Agricultural 

Research 

Council 

IR/CA 

Texas  990/12 
running  DX10 

1  database: 
up  to  4,000 
records 

Rio  Tinto 

Zinc  Corp 

IR/CA 

PDP  11/60 
running  RSTS 

1  database: 
up  to  10,000 
records 

British 

Petroleum 

IR/CA/PI/CT 

Texas  990/10 
running  DX10 

2  databases: 
up  to  7,000 
records 

Manpower 

Services 

Commission 

IR 

Perkin  Elmer 

3200 

1  database: 
up  to  10,000 
records 

Unilever 

(Australia) 

IR/LM/CA/PI 

CT/BG 

PDP  11/34 
running  RSX  11M 

1  database: 
up  to  50,000 
records 

Berec  Group 

Ltd 

IR 

Texas  990/10 
running  DXlo 

1  database: 
up  to  5,000 
records 

Chloride 
Technical  Ltd 

IR/LM 

Texas  990/10 
running  DXlo 

1  database: 
up  to  5,000 
records 

Glaxo  Ltd 

IR/LM/PI/BG/ 

CA 

Prime  750 

9  databases: 
up  to  100,000 
records 

KEY:  IR  -  Information  retrieval 
CA  =  Current  awareness 
CT  «  Catalogues 
WP  «  Word  processing 


LM  =  Library  management 
PI  -  Printed  indexes 
BG  =  Bulletins 
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ONE  YEAR  OF  EURONET  DIANE 
EXPERIENCE  AND  EXPECTATIONS 

BARRY  MAHON,  Head  Euronet  Launch  Tear 
15,  avenue  de  la  Faience rie,  Luxembourg. 


SUMMARY 

The  Euronet  DIANE  network,  the  information  facility  instigated  aid  developed  by  the  Conmission  of 
the  European  Conrnunities  commenced  commercial  service  on  31st  March  1980. 

This  paper  reviews  the  first  years  operations.  It  describes  the  telecommunications  network  aid  in 
particular  the  relationship  between  the  European  Commission  and  the  consortium  of  eleven  PIT's  who  provi¬ 
de  the  technical  facilities.  The  evolution  of  Euronet  Into  a  public  packet  switched  network  for  all  the 
European  Community  Member  States  and  also  for  countries  outside  the  Communities,  is  also  described. 

Concerning  the  information  services,  collectively  referred  to  as  DIANE,  the  paper  describes  the  ex¬ 
pansion  that  has  taken  place  in  files  available  since  commercial  opening  and  analyses  the  progressive  shift 
in  emphasis  from  purely  bibliographic  services  to  a  mix  of  factual  data  banks  and  bibliographic  data  bases. 

The  ancillary  facilities,  referral,  document  ordering  and  delivery  etc.  either  operational  or  under 
development  are  also  described  in  particular  the  activities  of  the  European  Communities  in  developing  the 
Ccmnon  Conmand  Language  and  the  programme  of  work  designed  to  overcome  the  Language  barrier. 

The  paper  discusses  experience  of  the  Conmission  and  the  Member  States  in  the  development  of  new 
information  services,  the  integration  of  new  forms  of  interactive  information  services  (Viewdata)  and  the 
contribution  that  Euronet  DIANE  has  made  towards  the  evolving  "telematique"  society. 


INTRODUCTION 

In  order  to  understand  the  origins  of  a  project  such  as  Euronet  DIANE  it  is  necessary  to  appreciate 
in  outline  the  administrative  structure  of  the  European  Economic  Community.  The  Ccmranity  has  its  legal 
basis  in  the  Treaty  of  Rome  signed  by  the  original  six  Menfcer  States,  acceded  to  by  United  Kingdom,  Den¬ 
mark  and  Ireland  in  1973  and  by  Greece  in  1981. 

Under  the  Treaty  the  Commission  of  the  European  Conmunities  was  set  up  to  provide  the  day  to  day- 
executive,  administrative  and  secretarial  role;  as  such,  it  is  an  international  civil  service.  The  Conmis¬ 
sion  may  propose  policies  and  projects  to  the  Member  States  but  may  not  initiate  major  Community  policies 
in  its  own  rigut.  Such  was  the  case  with  the  policy  basis  for  Euronet  DIANE.  In  1971  the  Council  of  Minis¬ 
ters,  the  representatives  of  the  governments  of  the  Menfcer  States,  approved  a  series  of  resolutions,  which 
had  been  proposed  earlier  by  the  ‘ommission,  the  purpose  of  vhich  were  to  create  a  network  for  the  collec¬ 
tion  and  dissemination  of  Scientific  and  Technical  Information  (STI)  in  the  European  Comrxnity. 

In  these  resolutions  neither  the  form  of  the  network  nor  STI  was  defined  explicitly.  However,  in 
the  resolutions  the  Council  created  a  committee  to  advise  the  Conmission  in  carrying  out  the  work.  This 
Comnittee  was  subsequently  called  CIDST  (Committee  for  Information  and  Documentation  in  Science  and  Tech¬ 
nology)  and  consists  of  representatives  nominated  by  the  Menfcer  States. 

This  brief  historical  background  is  necessary  because  it  is  often  erroneously  suggested  that  the 
Conmission  has  had  a  dictatorial  role  in  the  development  of  Euronet  DIANE.  The  truth  is  that  the  Conmis¬ 
sion' s  role  has  been  to  find  a  measure  of  agreement  between  the  Menfcer  States  in  order  to  carry  out  the 
terms  of  the  1971  resolutions. 


EURONET  DIANE 
one  year  of  operations 

The  telecommunications  network  become  available  for  'pilot'  operations  on  15  November  1979  and  was 
put  into  conmercial  service  on  31  March  1980. 

The  telecommunications  facilities  are  provided  under  contract  by  a  Consortium  of  the  PTTs  (Post  and 
Telecommunications)  of  the  participating  countries,  vhich  are  the  10  countries  of  the  European  Economic 
Conmunities  and  Switzerland.  A  PTT  management  committee  provides  the  policy  and  overall  supervisory  basis 
while  a  Network  Management  Centre,  located  in  London  provides  the  day  to  day  technical  management.  The 
Conmission  of  the  European  Conmunities  through  its  Directorate  General  XIII,  assisted  by  CIDST,  is  respon¬ 
sible  for  the  management  of  the  overall  project  (in  particular  relationships  with  the  'hosts')  and  provi¬ 
des  through  the  Euronet  Launch  Team  the  operations  management.  The  PTTs  are  responsible  for  technical  main¬ 
tenance  of  the  network  links  which  consists  of  5  switching  nodes  located  in  tendon,  Paris,  Rome,  Frankfurt 
and  Zurich  and  access  facilities  in  Brussels,  Luxembourg  (linked  to  Paris),  Dublin,  Amsterdam  (linked  to 
London)  and  Copenhagen  (linked  to  Frankfurt).  The  technology  of  the  network  is  packet  switching,  using  a  mo¬ 
dified  version  of  the  hardware  and  software  of  Transpac  the  French  national  packet  switching  data  network. 

This  separation  of  responsibilities  between  the  PTTs  and  the  CEC  was  established  from  the  outset  be¬ 
cause  the  Communities  did  not  envisage  any  long  term  involvement  in  provision  of  telecomnunlcations  links 
-  its  major  interest  being  the  provision  of  reliable  and  rapid  access  to  stores  of  information,  as  speci¬ 
fied  in  the  1971  resolution.  The  contract  between  the  CEC  and  the  PTT’s  allows  for  the  'takeover'  of  the 
telecommunications  by  the  PTTs  and  this  process  has  already  cormenced  and  will  accelerate  over  the  next  2/3 
years.  In  France,  for  example,  Euronet  has  never  really  existed,  its  functions  are  provided  by  Transpac 
*hich  is  linked  to  Euronet  at  Paris. 

In  the  near  fiiture  DATEX-P  will  provide  similar  facilities  in  Germany  and  PSS  in  the  IMited  Kingdom. 
These  developments  will  be  followed  by  national  data  communications  facilities  in  most  other  contries  so 
that  by  1984/85  the  links  provided  to-day  by  Euronet  will  be  superseded  by  various  interconnections  between 
national  facilities  in  Europe  and  for  access  to  other  areas  by  links  between  European  countries  and  USA, 
Japan,  the  Far  East  etc.  To  the  user  this  change  will  only  be  partly  apparent,  in  essence  he  or  she  will 
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use  their  present  national  PTT  to  provide  access  to  a  worldwide  data  communications  network  similar  to  the 
voice  network  we  use  to-day. 

In  the  period  of  slightly  over  one  year  that  Euronet  has  been  in  commercial  operation  almost  2000 
subscribers  have  signed  contracts  for  use.  They  have  access,  at  the  time  of  writing,  to  26  host  computers 
who  support  270  dat.  bases  and  data  banks  of  information.  The  evolution  of  these  services  in  the  first 
operational  year  illustrates  clearly  the  wealth  of  resources  which  Europe  contains  and  justifies  the  Com¬ 
munities  view  that  the  provision  of  Euronet  would  act  as  a  catalyst  in  the  creation  of  a  viable  informa¬ 
tion  industry  for  Europe.  On  ‘opening  day'  there  were  16  hosts  providing  96  files;  by  the  end  of  1980 
there  were  23  hosts  and  137  files  and  by  early  1982  it  is  envisaged  that  there  will  be  more  than  40  hosts 
and  over  300  files  available.  The  total  number  of  items  (bibliographic  references  or  factual  records) 
available  to-day  is  over  60  million  of  which  50-60%  are  European  in  origin. 

The  evolution  has  been  remarkable  not  only  for  the  sheer  size  of  the  number  of  items  searchable 
but  for  the  change  from  bibliographic  to  factual.  From  a  base  of  approximately  90:10  bibliographic:  fac¬ 
tual  in  early  1980  to-days  ratio  is  approximately  70:30.  This  change  is  particularly  apparent  in  the  sub¬ 
ject  area  of  business  and  economics  and  reflects  the  growing  interest  of  industry  in  accurate  and  easily 
accessible  production  and  financial  Information  for  decision  making.  Of  interest  also  is  the  geographic 
distribution  of  the  supplying  centres  (as  expected  by  end  1981)  shown  in  the  table  below: 


COUNTRY 

BELGIUM 

FRANCE 

LUXEMBOURG 

GERMANY  F.R. 

SWITZERLAND 

DENMARK 

ITALY 

NETHERLANDS 
UNITED  KINGDOM 


NUMBER  OF  HOSTS 

3 

11 

2 

6 

1 

2 

5 

2 

5 


SUPPORT  FACILITIES 

In  addition  to  the  continuous  support  provided  by  the  Euronet  DIANE  Launch  Team  for  users,  there 
are  a  number  of  other  services  and  user  aids  available.  The  biggest  and  most  pervasive  of  these  is  the  On¬ 
line  News  and  Information  service  available  free  of  charge  to  all  users.  This  facility,  provided  in  six 
languages,  contains  a  news  file  in  which  the  hosts  and  the  PTTs  can  inform  customers  of  operational  changes, 
holiday  periods  etc;  a  complete  list  of  the  hosts  and  data  bases/data  banks  available,  a  list  of  PTT  con¬ 
tact  points  and  details  of  technical  requirements  for  using  the  network.  The  service  is  continually  upda¬ 
ted  by  the  launch  Team  and  the  arrangements  are  such  that  an  urgent  news  item  can  be  mounted  in  six  lan¬ 
guages  within  one  hour  of  receipt.  Another  important  support  facility  is  the  Common  Command  Language  (CCL). 
It  was  realised  early  in  the  development  of  Euronet  DIANE  that,  faced  with  a  multiplicity  of  hosts  and 
consequently  wi th  a  number  of  retrieval  languages,  users  would  have  difficulty  in  'learning'  the  various 
procedures.  The  Commission  developed  in  association  with  a  number  of  hosts  a  set  of  commands  which  would 
be  'universal'  at  least  in  their  syntax  if  not  in  their  detailed  effect.  These  commands  can  be  implemented 
by  a  host  either  as  a  replacement  for  existing  commands  or  via  a  translation  device  mounted  on  his  machine. 
For  example  to  locate  a  keyword  and  display  the  number  of  occurances  the  CCL  uses  ’FIND1;  to  list  a  set 
of  keywords  for  selection  the  CCL  uses  'DISPLAY'  and  to  bring  up  retrieved  references  the  CCL  uses  ’SHCW’. 

Through  this  method  a  user  can  switch  from  one  host  using  CCL  to  another  and  perform  searches  using 
the  same  commands.  To-date  seven  hosts  have  implemented  CCL  and  a  further  7-10  will  provide  it  in  the  near 
future. 

Also  the  network  has  available  a  file  which  gives  details  of  compatible  terminals  for  Euronet,  the 
first  file  of  an  envisaged  all  embracing  referral  facility  which  will  provide  a  detailed  subject  guide  to 
the  available  information  files,  a  list  of  document  supply  centres,  details  of  forthcoming  training  eour- 
ses  and  possibly  an  electronic  nail  box  for  users.  Other  facilities  will  make  available  a  multilingual  da¬ 
ta  bank  of  terminology,  multilingual  thesauri  to  aid  searching  and  a  pilot  service  for  machine  transla¬ 
tion  of  retrieved  references  and  abstracts.  Another  programme  of  work  undertaken  by  the  Commission  will, 
when  the  necessary  policy  and  legal  considerations  have  been  completed  provide  an  electronic  document  or¬ 
der  ing  and  delivery  facility  with  stores  of  machine  readable  documents  and  high  speed  digital  facsimile 
for  their  reproduction.  In  fact  some  of  the  hosts  already  on  the  network  provide  facilities  for  on-line 
ordering  of  retrieved  documents,  the  orders  being  processed  by  'conventional’  photocopy  supply  agencies 
such  as  the  British  Lending  Library  or  the  Technische  Hochschule  at  Hannover. 


THE  FUTURE 

Under  the  Plan  of  Action  1978-1980  the  Commission  Launched  a  ‘call  for  proposals'  to  support  the 
development  of  new  information  products.  This  'call'  was  answered  by  267  proposals  and  obviously  not  all 
of  these  can  be  funded.  To-date  26  projects  have  received  approval  and  will  result  in  the  creation  of  a 
number  of  new  Information  services,  rt  is  hoped  that  in  1982  another  such  'call'  can  be  made  and  thereby 
a  continuous  flow  of  new  or  restructured  Information  products  will  become  available.  In  addition  the  Com¬ 
mission  and  C1DST  are  evaluating  the  export  potential  for  European  information  services,  initially  in 
North  America  but  with  a  view  to  later  worldwide  distribution. 

While  these  activities  aimed  at  widening  information  supply  have  been  going  on  the  technical  deve¬ 
lopments  taking  place  In  ancillary  fields  have  not  been  ignored.  The  most  interesting  of  these  are  Video¬ 
tex  or  Viewdata  services  designed  to  provide  access  to  information  via  the  telephone  and  modified  TV  sets. 
A  project  to  Interface  a  Euronet  host  and  videotex  terminals  via  Euronet  is  almost  complete  aid  a  number 
of  trials  in  areas  such  as  agricultural  Information  and  energy  saving  information  are  in  course  of  deve¬ 
lopment. 
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FORUM  DISCUSSION  -  USER/SUPPLIER  DIALOGUE 
A  summary,  prepared  by 

Rudiger  BERNHARDT  Gesellschaft  fflr  Information  und  Dokumentation  mbH, 

Herriotstrasse  5,  6000  Frankfurt  am  Main  71,  FRG 

Gerd  TITTLBACH  Fachinformatlonszentrum  Energie,  Physlk,  Mathematik 

GmbH,  Xernforschungzentrum,  7514  Eggenstein- 
Leopoldshafen  2,  FRG 


The  purpose  of  the  Forum  Discussion  was  to  provide  the  opportunity  for  an  exchange  of 
views  between  users  and  suppliers  of  information  services  hardware  and  software  with 
the  aim  of  achieving  better  cooperation  than  previously.  The  discussion  took  place 
at  the  end  of  the  Specialists'  Meeting  and  the  principal  participants  were  as  follows: 

Chairmen:  R  Bernhardt,  FRG 

H  Sauter,  USA 

Suppliers:  R  G  Cypser,  IBM,  USA 

D  Morgenstern,  Siemens  AG,  FRG 

Users:  R  Bree,  FRG 

M  Day,  USA 
B  Mahon,  Luxembourg 

To  start  the  discussion  the  two  suppliers'  representatives  gave  a  review  of  major 
hardware  and  software  developments  and  of  future  plans  which  might  influence  work  in 
the  information  field.  It  was  evident  that  over  recent  years  the  manufacturers  have 
achieved  significant  progress  in  hardware.  Computers  have  become  larger  and  more 
powerful,  and  the  capacity  of  the  stores  has  expanded  rapidly,  while  hardware  costs 
have  decreased.  Once  basic  technical  problems  and  the  legal  requirements  of  tele¬ 
communications  had  been  settled,  data  communication  networks  were  built  up  and  became 
operational.  The  reaction  of  the  manufacturers  was  to  focus  their  efforts  Increasingly 
upon  management  support  systems,  computer-computer  communications,  dialogue  systems  for 
common  use,  and  Information  retrieval  systems  combined  with  editing  and  delivery  systems. 
However,  nowadays,  office  automation  is  obviously  occupying  an  increasingly  significant 
role  in  the  market. 

Hitherto,  the  development  of  data  processing  was  driven  mainly  by  technological 
evolution  and  not  by  customer  requirements.  So  there  is  still  a  lack  not  only  of 
application  software,  but  also  basic  software. 

With  that  background  the  discussion  was  set  up.  User  requirements  on  the  one  hand  and 
the  business  interests  of  the  suppliers  on  the  other  were  described  from  various  points 
of  view.  It  became  clear  that  users  often  are  not  able  to  precisely  describe  their 
demands.  But  it  was  recognized  that  many  would-be  users  are  limited  in  their  knowledge 
of  what  it  is  reasonable  to  request,  and  that  manufacturers  are  often  either  not  aware 
of  the  real  problems  of  the  appliers  or  choose  to  ignore  requirements  which  happen  not 
to  coincide  with  their  cost/benefit  thinking.  We  are  in  an  era  where  technology  is 
leading  application.  The  user  virtually  is  always  behind  in  his  use  of  technology 
and  very  far  behind  the  leading  edge  of  technology. 

What  of  the  future?  The  new  user  generation  seems  to  be  more  information  minded  than 
its  predecessor.  User  groups  are  being  established  and  a  broad  information  and 
communication  market  is  growing.  Nevertheless,  the  situation  concerning  suppliers' 
support  should  not  be  viewed  too  optimistically.  Although  the  suppliers  confirm  that, 
there  is  a  broad  lack  of  support  in  our  application  field,  because  of  the  pressure 
of  costs  and  the  Increasing  number  of  new  application  fields  which  continually  appear. 
Suppliers  will  not  be  able  to  develop  special  hardware  and  software  tools  for  applications, 
especially  in  a  restricted  market.  Therefore,  what  the  manufacturers  will  offer  in 
future  will  be  more  generalized  solutions.  To  balance  this  trend  there  is  a  need  to 
create  strong  user  groups  to  inform  suppliers  about  specific  requirements,  indicating 
their  value  and  relevance.  It  has  to  be  realised  that,  in  the  manufacturers'  view, 
the  Information  services  field  constitutes  only  a  limited  market,  much  less  significant 
than  the  office  automation  area,  for  example.  It  follows  from  this  that  major 
developments  will  normally  occur  outside  the  information  and  documentation  area  and 
significant  suppliers'  support  can  only  be  expected  when  requirements  conform  to  the 
mainstream  requirements.  Therefore  all  the  developments  in  related  application  fields 
must  be  scrutinized  as  before,  to  evaluate  and  pick  out  ideas  and  tools  which  might  be 
transferred  and  adapted  in  the  beat  manner  possible  for  our  particular  application. 


Despite  this  rather  pessimistic  assessment,  there  is  some  hope  that  over  the  next  ten 
years  good  progress  can  be  expected  in  different  fields,  such  as: 
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-  improved  input/output  devices,  multifont  readers,  flat  colour  displays,  intelligent 
displays,  microprocessors,  small  laser  printers,  intelligent  copying  machines 

-  increased  storage  capacity,  optical  disc  memories 

-  database  management  systems  by  new  concepts 

-  information  retrieval  systems  considering  the  compilation  of  formatted  and 
unformatted  data 

-  software  improvements  in  linguistic  data  processing 

-  telecommunications  by  using  common  networks  and  in-house  communication. 


In  the  second  part  of  the  Forum  Discussion  the  audience  had  the  opportunity  of  putting 
further  questions  to  Speakers,  Panel  Members  and  other  participants.  To  summarize  the 
extended  discussions  is  very  difficult  but  the  following  points  were  in  the  centre  of 
interest: 

-  studies  of  user  requirements  and  market  research  as  a  basis  of  further  progress  In 
both  hardware  and  software 

-  cost  and  budgeting  of  information 

-  public  relations  and  marketing  of  information  services 

-  managing  the  overwhelming  flood  of  information,  redundancy  of  information,  and  the 
access  to  the  full-text  by  improved  information  analysis,  evaluation  and  verification 
methods 

-  problems  of  data  acquisition,  analysis  of  data  and  its  organization  (who  will  do  it, 
where  will  it  be  done;  how  to  reduce  and  share  the  cost) 

-  international  cooperation  and  promotion  of  information  exchange,  two-way  flow  of 
information 

-  acquisition  problems  of  scientific  and  technical  libraries  in  the  light  of 
restricted  budgets 

-  copyright  problems  involved  with  document  delivery  from  databases. 

This  second  part  of  the  discussion  showed  an  Interesting  spectrum  of  questions 
covering  common,  every-day,  but  nevertheless  hard  problems.  However,  while  discussions 
usually  cannot  solve  existing  problems  they  do  serve  to  illuminate  them  by  showing 
different  ways  -  be  they  good  or  bad  -  whereby  more  effective  solutions  can  be  sought. 
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